Human immunoglobulin VH gene segments and DNA fragments containing the same

ABSTRACT

Novel human immunoglobulin V H  segments and DNA fragments containing the same are disclosed. The DNA fragment according to the present invention is the fragment having a size of about 800 kbp which is shown in FIG. 1. The human immunoglobulin V H  segments according to the present invention are contained in the fragment of this DNA fragment of about 800 kbp, and there are 50 novel segments. The base sequences of these :segments are shown in the Sequence Listing. The present invention also provides DNA fragments which contain two or more of these V H  segments.

This is a 371 national stage filing of International Application No. PCT/JP93/00603, with an international filing data of May 10, 1993, now abandoned.

TECHNICAL FIELD

This invention relates to novel human immunoglobulin V_(H) gene segments and DNA fragments containing the same. The segments and DNA fragments according to this present invention are useful for producing human antibodies using a mammalian host by a genetic engineering process.

BACKGROUND ART

Immunoglobulins are composed of the L chains and H chains, each of which consists of a variable region (V region) and a constant region (C region) that has a structure common to immunoglobulin molecules. What determines the antigenic specificity of an antibody is the V region. The V region of the H chain is encoded by V, D (diversity) and J (joining) genes (The gene of the H chain is expressed by placing a suffix "H", like "V_(H) "). One of the important reasons why the V regions of immunoglobulins are highly diverse and can provide antibodies which specifically binds to infinite number of antigens is the rearrangement of V, D and J genes. That is, there are a plurality of V genes, D genes and J genes, respectively and they are randomly combined in somatic cells to form a gene encoding a single mRNA. Since the combination is randomly selected, side variety of immunoglobulin V regions are provided.

On the other hand, antibodies currently employed for therapies of various diseases are those originated from animals other than human, such as mouse. However, if these antibodies are administered to human, since the antibodies are of exogenous origin, an immunological response occurs in the human body to present allergy and to neutralize the antibodies. To overcome this problem, it is desired to use antibodies originated from human for the therapies for human. Further, if a human antibody is industrially produced using human as the host and using a human-originated antigen, a problem of immunological tolerance is brought about, so that this approach employing the known method is very difficult. Thus, the production of human immunoglobulins by a genetic engineering process using an animal as a host is now being developed (for example, Japanese Laid-open PCT Application (Kohyo) No. 4-504365; Proc. Natl. Acad. Sci. USA, Vol. 86, pp.5898-5902, August 1989; Proc. Natl. Acad. Sci. USA, Vol. 87, pp.5109-5113, July 1990; Genomics 8, 742-750 (1991)). However, in the conventional methods in which human immunoglobulin genes are expressed in host animals other than human, there is a problem that the number of human V_(H) segments provided for the genetic recombination is very small, so that the diversity of the expressed human immunoglobulins is limited. Even if only one V_(H) segment is recombined, the diversity of the immunoglobulin is assured to some degree because of the combination with D and J genes. However, as mentioned above, since the diversity of immunoglobulins is determined by the rearrangement (random combination) of V gene segments, the more the human V_(H) segments recombined, the higher the diversity of the immunoglobulins expressed. If the diversity of immunoglobulins is increased, not only antibodies against a number of antigens can be formed, but also the possibility of forming an antibody having a high specificity to a given antigen is promoted. Therefore, it is important for therapies and diagnoses to recombine V_(H) segments as many as possible.

DISCLOSURE OF THE INVENTION

Accordingly, an object of the present invention is to provide a DNA fragment comprising a plurality of human immunoglobulin V_(H) segments. Another object of the present invention is to provide a novel human immunoglobulin V_(H) segments.

The present inventors intensively studied to succeed in determining human immunoglobulin H chain V region gene segments having a size of about 800 kb and in determining DNA sequences of 64 human V_(H) segments contained therein. This made it possible to provide this DNA fragment of 800 kb and various DNA fragments contained therein, thereby completing the present invention.

That is, the present invention provides a DNA fragment having a size of about 800 kbp and having the structure shown in FIG. 1. It should be noted that in FIG. 1, the 64 human V_(H) segments are those having DNA sequences shown in Sequence ID Nos. 1, 2, . . . 63, and 64, respectively, in the order from downstream (i.e., from the side near the J_(H) gene).

The present invention also provides DNA fragments containing at least two consecutive functional human V_(H) segments which are contained in said DNA fragment of about 800 kb according to the present invention.

The present invention further provides DNA fragments Y20, Y103, Y21, Y6, Y-24, M131, M118, M84 and 3-31, which have been deposited.

The present invention still further provides DNA fragments consisting essentially of at least two optional DNA fragments linked in an optional order, each of which contains at least two consecutive functional human V_(H) segments contained in the DNA fragment of about 800 kb according to the present invention.

The present invention still further provides DNA fragments consisting essentially of at least two DNA fragments selected from the group consisting of DNA fragments Y20, Y103, Y21, Y6, Y-24, M131, M118, M84 and 3-31 which have been deposited, which are linked in an optional order.

The present invention still further provides novel human immunoglobulin V_(H) segments having DNA sequences shown in Sequence ID Nos. 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 63 and 64, respectively.

By the present invention, novel human immunoglobulin V_(H) segments and DNA fragments containing the same were provided. The DNA fragment of about 800 kb according to the present invention contains as many as 64 human immunoglobulin V_(H) segments. Thus, by producing human immunoglobulins by a host animal using this DNA fragment, the diversity of the produced human immunoglobulin is largely increased when compared with the conventional methods.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a genetic map of the DNA fragment of about 0.8 Mb according to the present invention.

FIG. 2 shows the results of Southern hybridization of a representative DNA inserted in YAC.

FIG. 3A shows the results of Southern hybridization of the fragment digested with restriction enzymes Mlu I and Not I.

FIG. 3B shows a physical map of a YAC clone constructed based on the results shown in FIG. 3A.

FIG. 4 shows a genetic map of YAC clone Y6.

BEST MODE FOR CARRYING OUT THE INVENTION

The present inventors prepared a library by inserting the DNA partially digested with Ecco RI into YAC by the method detailed in the examples hereinbelow described, which DNA was originated from human lymphoblastoid cell line transformed by EB virus, and succeeded in determining the structure of human V_(H) gene region having a size of about 800 kbp using the above-mentioned library. The structure is shown in FIG. 1. In FIG. 1, the genetic map is shown on the four thick solid lines. The right side of each solid line is the 3' side and the left end of the upper most solid line continues to the right end of the second solid line. In the DNA fragment shown in FIG. 1, there exist C genes, J_(H) genes and D genes in the order mentioned from the 3' end. Subsequent to the D genes, there are 64 V_(H) segments. The DNA sequences of all of these 64 V_(H) segments have been determined as described in the examples below, and Sequence ID Nos. 1, 2, . . . 63, 64 were assigned to the 64 V_(H) segments in the order from downstream. Among these V_(H) segments, the functional V_(H) segments which are thought to encode polypeptides are indicated by solid rectangles. On the other hand, those which have the general features of the known V_(H) segments but do not presently encode polypeptides because of the termination codons; contained therein, that is, pseudo V_(H) segments are indicated by hollow rectangles. Immediately below the genetic map, restriction maps by Eco RI and Hind III are shown. The restriction sites are indicated by short perpendicular lines. The short lines to which ends circles are attached are those whose order is not determined, and the dotted boxes indicate the regions in which Eco RI sites have not been determined. In FIG. 1, the symbol which looks like "Y" indicates the sites at which two restriction sites are close. In FIG. 1, restriction sites of Mlu I are indicated by hollow triangles and restriction sites of Not I are indicated by solid triangles. The fragments inserted in the clones employed for determining the structure of the DNA fragment are shown thereunder. The structure of the 3' side farther than the 3' end shown in FIG. 1 is known and described in Ravetch, J. V. et al., (1981) Cell, Vol. 27, pp.583-591.

Among the DNA fragments inserted in the clones shown in FIG. 1, the yeasts containing Y20, Y103, Y21, Y6 and Y24 inserted in YAC have been deposited with National Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology at 1-3, Higashi 1-chrome Tsukuba-shi. Ibaraki, 305 JAPAN on Apr. 22, 1993 under accession numbers FERM BP-4272, FERM BP-4275, FERM BP-4273, FERN4 BP-4271 and FERM BP-4274, respectively. The E. coli cells containing M131, M118, M84 and 3-31, respectively, inserted in cosmids have been deposited with National Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology at 1-3, Higashi 1-chrome Tsukuba-shi, Ibaraki, 305 JAPAN on Apr. 22, 1993 under accession numbers FERM BP-4279, FERM BP-4278, FERM BP-4277 and FERM BP-4276 respectively.

The DNA fragment having a size of about 800 kbp shown in FIG. 1 can be prepared by linking these deposited DNA fragments by known methods. That is, a DNA fragment A and a DNA fragment B whose DNA sequence at its terminal region overlaps with the DNA sequence of the terminal region of DNA fragment A (i.e., the DNA sequence of the 3' region of DNA fragment A is identical to the DNA sequence of the 5' region of DNA fragment B) can be easily ligated by a method exploiting genetic recombination in the yeast cells. More particularly, DNA fragments A and B are inserted in separate YAC vectors, and the resulting recombinant YAC vectors are introduced in separate mating type yeast cells, respectively. The resulting yeast cells are then fused. By this, genetic recombination occurs in the yeast host to form a YAC having a DNA fragment in which DNA fragment A and DNA fragment B are ligated, which has only one overlapping region located at the terminal regions of DNA fragments A and B. The thus formed recombinant YAC can easily be selected using the auxotrophy encoded in the YAC as a marker. This method is well-known in the art, and is described in, for example, Japanese Laid-open PCT Application (Kohyo) No. 4-504365; Proc. Natl. Acad. Sci. USA, Vol. 87, pp.9913-9917, December 1990; Science Vol. 250, p.94, Proc. Natl. Acad. Sci. USA, Vol. 89, pp.5296-5300, June 1992; and Nucleic Acid Research, Vol. 20, No. 12, pp.3135-3138. Since the terminal regions of each of the deposited 8 DNA fragments overlap the respective terminal regions of the adjacent DNA fragments, they can be ligated sequentially by the method described above. Although DNA fragments 3-31, M84, M118 and M-131 are cloned in cosmid vectors, they can be kept in an artificial chromosome in the yeast cell by cutting the recombinant cosmid with a restriction enzyme having a restriction site only in the cosmid vector, and ligating a YAC vector to the ends of the digested recombinant cosmid vector. Further, by the above-described method, the digested recombinant vector can be ligated to a YAC clone of other regions. It should be noted that even if the above-mentioned 9 deposited fragments are ligated, a gap of about 4 kb still remains. A DNA fragment which fills the gap can be easily prepared by the method described below. That is, as shown in FIG. 1, since the Hind III fragment including the region of the gap is relatively large, this Hind III fragment can be obtained by completely digesting human genome by Hind III, electrophoresing the resultant, selecting DNA fragments having sizes of about 15 kb, detecting the desired fragment with a probe, and recovering the detected desired fragment. The probe used here can be isolated as follows. That is, the DNA fragments located at the both ends of the gap are subcloned using a plasmid and DNA fragments which do not contain a repetitive sequence are prepared therefrom. The thus obtained fragments are then used for screening of the library. Only those detected by the probes which are the DNA fragments at both ends of the gap are isolated.

As described above, the DNA fragment of about 800 kbp shown in FIG. 1 was provided according to the present invention. The fragments consisting of the DNA region included in this DNA fragment can also be used for producing human immunoglobulin by a genetic engineering method. More particularly, to increase the diversity of human immunoglobulin produced by a genetic engineering method, it is preferred to incorporate a fragment containing human V_(H) segments as many as possible. However, if the fragment contains at least two human V_(H) segments, the diversity to some degree is given during rearrangement, so that the fragment can be employed. Thus, DNA fragments consisting of a region containing at least two consecutive functional V_(H) segments, which region is contained in the DNA of about 800 kb shown in FIG. 1 can be employed and are useful. The number of the functional V_(H) segments contained in such DNA fragments is at least two, and is preferably not less than 6. The more the number of the functional V_(H) segments, the higher the diversity of the human immunoglobulin produced, so that the more preferred. Thus, the preferableness is increased when the number of the functional V_(H) segments is 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32 and 33, with the order mentioned. Among these fragments, although those having large molecular weights are cloned into YAC vector, small fragments having a size of about not more than 50 kb are not necessarily cloned into YAC vector, but can be cloned into cosmid vectors and plasmid vectors.

Such DNA fragments can be prepared since the information disclosed in FIG. 1 and Sequence ID Nos. 1-64 is available. That is, for example, a DNA fragment containing not less than two functional V_(H) segments can be obtained by partially digesting human genome with an appropriate restriction enzyme such as Eco RI or Hind III, separating the resulting fragments by electrophoresis, and selecting a DNA fragment containing not less than two desired functional V_(H) segments using not less than two probes each of which hybridizes with one of the not less than two desired functional V_(H) segments. Alternatively, amplification by PCR may be employed in place of the detection by the probes. In this case, since the entire DNA sequences of the functional V_(H) segments are known, the DNA sequences of the primers which should be used are also known, so that the PCR can be carried out easily.

The present invention further provides DNA fragments consisting essentially of optional DNA fragments each of which contains not less than two functional V_(H) segments which are ligated in optional orders. That is, by ligating a plurality of the DNA fragments each containing not less than two functional V_(H) segments, the number of V_(H) segments in the DNA fragment can be increased when compared with the case where only one such DNA fragment containing not less than two V_(H) segments is used, so that the diversity of the produced immunoglobulin can be increased accordingly. The DNA fragments are not necessarily consecutive, and optional DNA fragments may be ligated in an optional order. In cases where there is no overlapping region between two DNA fragments to be ligated, the above-described method for ligating the DNA fragments having an overlapping region cannot be applied. However, two DNA fragments having no overlapping region can also be ligated by the method as follows.

The left arm vector region and the right arm vector region of a YAC clone containing not less than two functional V_(H) segments are recovered by the method of Hermanson et al (1991) (Nucleic Acids. Res.,19; 4943-4948). A plasmid (pICL) which has a sequence homologous with the ampicillin-resistant marker (AMP) in the left arm vector region of the YAC, a marker (Lys) which reverse the lysine auxotrophy to the wild type, and a multiple cloning site immediately downstream Lys; and a plasmid (pLUS) which has a sequence homologous with YAC4 region in the right arm vector region of the YAC, the above-mentioned Lys, a kanamycin-resistant marker (KAN), and a multiple cloning site immediately downstream the KAN are linearized and then introduced into yeast cells containing YAC by a conventional method. The plasmids pICL and pLUS cause recombination in the yeast cells at an appropriate frequency, thereby being recombined with the left arm vector region and the right arm vector region of the YAC. The yeast cells carrying such a YAC are selected by using an appropriate selection medium and the YAC in the selected yeast cells is then cut with an appropriate restriction enzyme which has a restriction sites in the multiple cloning sites of the above-mentioned plasmids. By the operation described above, DNA fragments containing the left end or the right end of the DNA fragment originated from human contained in the YAC are recovered as plasmids. After amplifying the thus obtained plasmids in E. coli by a conventional method, the recovered plasmids are digested with a restriction enzyme and then ligated by ligase. The thus ligated DNA fragment is then ligated to the left arm vector region or the right arm vector region of the YAC and introduced into yeast cells carrying the YAC. These YAC vectors causes recombination at a certain frequency between the intrinsic left arm or right arm vector regions and the left end or right end region of the DNA fragment originated from human. By selecting the resulting recombinant vectors, a YAC clone containing a DNA fragment originated from human, which left end is ligated to the right end of another DNA fragment originated from human, and a YAC clone containing a DNA fragment originated from human, which right end is ligated to the left end of another DNA fragment originated from human are recovered. Since these YAC clones have the structure in which the left end or the right end of a DNA originated from human is ligated to the right end or the left end of another DNA originated from human, they can be recombined with a YAC clone having a sequence in the ligated DNA fragments by the method described above.

Further, by optionally ligating the above-described eight actually deposited DNA fragments in an optional order, a large fragment containing a number of V_(H) segments can be prepared.

By the present invention, the DNA sequencers of the 64 V_(H) segments contained in the fragment of about 800 kbp shown in FIG. 1 were determined. Amino acid sequences encoded by the DNA of SEQ. ID NOS: 1-13, 15-18, 20-28, 30-31, 33-39, 41, 43-56, and 58-64 are provided respectively in SEQ ID NOS: 89-145. As described in detail in the examples below, among these, 50 V_(H) segments are novel segments which have DNA sequences that have not hitherto been known. These novel human immunoglobulin V_(H) segments include pseudo segments which do not encode a polypeptide. Even a pseudo segment has an utility because it may function as a donor of gene conversion in the somatic cell level.

The human immunoglobulin V_(H) segments and the DNA fragments containing the same according to the present invention can be used for producing human immunoglobulins in a mammalian host as described in, for example, Japanese Laid-open PCT Application (Kohyo) No. 4-504365.

EXAMPLES

The present invention will now be described in more detail by way of examples thereof. It should be noted that the present invention is not limited to the following examples.

Example 1 Determination of Structure of DNA Fragment of About 800 kbp

(1) Library Used for Screening

The human YAC library screened was constructed from DNA of an Epstein-Barr virus-transformed human lymphoblastoid cell line CGM1 (T. Imai and M. V. Olson, genomics, 8, 297-303 (1990)). Eco RI partial digests of CGM1 DNA were ligated to pYAC4 vector (D. Burkea and M. V. Olson, in "Guide to Yeast Genetics and Molecular Biology" (C. Guthrie and G. R. Fink, eds), p.253, Academic Press, Orlando, 1991), and introduced into AB1380 yeast host strain (D. Burke and M. V. Olson, in "Guide to Yeast Genetics and Molecular Biology" (C. Guthrie and G. R. Fink, eds), p.253, Academic Press, Orlando, 1991). The library consisted of 15,000 independent clones with mean YAC size of about 360 kb. The library thus contained the equivalent of approximately 1.8 haploid human genomes. DNA rearrangement in immunoglobulin H chain (IgH) locus was first checked by Southern hybridization using the human D and J_(H) probes. The result showed that an allele kept germline configuration while the other was VDJ rearranged.

(2) Primers Used for PCR-Based Screening

For PCR-based screening of human V_(H) YAC clones, oligonucleotide primers for V_(H-III) and V_(H-I) families, the first and the second largest V_(H) families, were synthesized. V_(H) region segments of immunoglobulins contain two hypervariable regions (CDR1 and CDR2) and three framework regions (FR1, FR2 and FR3) (E. A. Kabat et al., Sequences of Proteins of Immunological Interest, Fifth edition, NIH publications, Washington D.C. (1991)). Nucleotide sequences of the framework regions are highly conserved within the same family, suggesting the possibility of oligonucleotide synthesis is for consensus primers corresponding to the framework regions. For this purpose, nucleotide sequences of FR1, FR2 and FR3 regions in all the known V_(H) sequences were aligned for comparison. Nucleotide sequences corresponding to the first 8 amino acid residues of the FR1 region had extremely high conservation not only within the same family but also between V_(H-I) and V_(H-III) families, which enabled the synthesis of a forward primer F-univ common for the two families as shown in Table 1. Sequences for family-specific reverse primers were independently chosen from conserved sequences in the FR2 region so that 3'-half of the primer sequence has 100% identity to known V_(H) segments and, in particular, 3'-most nucleotide corresponds to the first letter of the highly conserved/invariant amino acid residues. More particularly, F-univ and I-R, and F-univ and III-R were used as primers for the screening. The DNA sequences of the primers are shown in Table 1.

(3) Optimal PCR Condition Check

Analytical experiments were carried out to determine the optimal condition for specific amplification. A reaction mixture (5 μl) was prepared in accordance with the protocol recommended by Perkin-Elmer/Cetus. Thermal cycling was performed using a DNA Thermal Cycler (Perkin-Elmer/Cetus). Reactions were carried cut using 25 ng of template human DNA under various annealing temperatures (55° C., 58° C., 60° C. and 62° C.) and cyrcles (25, 30, and 35 cycles). As a result, it was found that the reaction under high annealing temperature, namely 94° C., 1 minute--62° C., 2 minutes --72° C., 2 minutes, regardless of cycles, produced specific amplification in human DNA sample but not in yeast strain AB1380 DNA. PCR under low annealing temperature sometimes gave false positive signals in negative control and therefore could not be used. Thus, the PCR was carried out under the above-described conditions.

(4) Polymerase Chain Reaction (PCR)

PCR-based first screening was performed Using synthesized oligonucleotide primers described above against seven multi-filter DNA pools each of which represents the DNA from 1920 colonies (20×96-well) as described (E. D. Green and M. V. Olson, Proc. Natl. Acad. Sci. USA, 87, 1213-1217 (1990)). Positive multi-filter pools were divided into five pools each of which consists of 384 colonies (4×96-well), and further screened by the same procedure. 25 ng each of YAC pool DNAs were used for reaction. DNA of CGM1 whose DNA was used to construct the YAC library, and of the yeast strain AB1380 were included during the PCR analysis as positive and negative controls, respectively. After the amplification, the entire sample was analyzed by electrophoresis in 10% polyacrylamide gels containing 15% glycerol and visualized by ethidium bromide staining.

(5) Colony Hybridization

After PCR-based first and second screening, the location of the positive clone within the 384-clone array was established by conventional colony hybridization. The nylon filters consisting of 384 YAC clones were prepared by a known method (D. Burke and M. V. Olson, in "Guide to Yeast Genetics and Molecular Biology" (C. Guthrie and G. R. Fink, eds), p.253, Academic Press, Orlando, 1991). V_(266BL) (Y. Nishida, et al., Proc. Natl. Acad. Sci. USA, 79, 3833-3837 (1992)) and V_(HBV) (M. Kodaira et al., J. Mol. Biol., 190, 529-541 (1986)) were used for probes representative for human V_(H-I) and V_(H-IIII) families, respectively. These probes were labeled (5×10⁵ cpm) with ³² P-dCTP using Oligolabeling Kit. (Pharmacia) and subjected to colony hybridization according to standard procedure (D. Burke, et al., supra). After the hybridization for 12 hours at 65° C., filters were washed twice with 2×SSC (1×SSC is 0.15 M NaCl-15 mM sodium citrate) for 10 minutes at room temperature, then twice with 0.2×SSC-0.1% SDS for 30 minutes at 65° C. Filters were exposed overnight and corresponding positive YAC clones were picked up for further characterization.

(6) Insert Check by Colony PCR

To test the presence of specific DNA sequence in isolated YACs, simple and rapid rescreening of colony-purified clones was carried out by using PCR without DNA purification (E. D. Green and M. V. Olson, Proc. Natl. Acad. Sci. USA, 87, 1213-1217 (1990)). That is, the positive yeast clones were streaked onto AHC plates and grown. Four each of single colonies from each clone were transferred by toothpick into 5 μl of PCR mixture described above. PCR and following gel electrophoresis were performed for identification of the amplified bands under the same condition as that used for screening. In most of the clones, all of the four colonies gave rise to specific amplification of DNA fragments.

(7) Sizing of YAC Clones Using PFGE

Many researchers claimed that some YACs are clonally unstable due to intrachromosomal rearrangement during the growth in culture resulting in size variation of the human DNA insert. This artifact is considered to be often mediated by repetitive sequences or tandem repeat of homologous DNA sequences in the insert DNA. Since V_(H) locus contains a number of homologous DNA fragments consisting of V_(H) gene segments and their flanking regions, such kind of rearrangement can take place at considerable frequency. An additional problem is the presence of single yeast containing more than one insert YACs. In order to exclude the artifact clones for subsequent analysis and to identify YAC clones with multiple insert, the sizes of the YAC clones were first determined by pulse field gel electrophoresis (PFGE). The same four V_(H) -positive single colonies checked by PCR were selected from 17 colonies originating from a single well, and miniprepared from 5 ml culture in AHC medium to give low-gelling temperature agarose blocks by a known method (D. Burke et al., supra). Appropriate sized piece of agarose block was used for sizing the YACs by PFGE with a Pulsaphor (Pharmacia) or a Crossfield (ATTO, Tokyo, Japan) gel electrophoresis apparatus at 60 second pulse time. Concatamerized lambda DNA was also loaded as a size standard. After the electrophoresis, DNAs were transferred to nitrocellulose filter and subjected to Southern hybridization using pBR322 plasmid as a probe. Typical result is shown in FIG. 2. All of the four colonies selected from each of clones Y21, Y22 and Y24 having DNA inserts with a size of 300 kb, 330 kb and 310 kb, respectively exhibited the same size, so that they seemed to have no recombination. On the other hand, since four colonies selected from clone Y23 held DNA inserts with different sizes, the insert of the clone Y23 looked rather unstable due to frequent recombination. Therefore, the colony which did not cause recombination was selected for the subsequent analysis. All but 3 clones including clone Y23 of 17 V_(H) -carrying YAC clones including the analyzed V_(H) displayed instability of human inserts. Subsequent analysis revealed that such recombination took place regardless of the number of V_(H) segments in the insert DNA, indicating some other factors might be involved in homologous recombination. From 14 stable YAC clones among the 17 YAC clones containing V_(H), Y20, Y103, Y21, Y6 and Y24 were selected and used for the subsequent physical mapping.

(8) Physical Mapping of YAC Clones with Rare Site Endonucleases

Gel blocks were prepared from the YAC clones after sizing and were used for physical map construction by PFGE. In general, detailed physical map using several enzymes might be required for long-range YAC analysis. In this example, however, only two rare-site restriction enzymes (i.e., restriction enzymes whose restriction sites occur relatively rarely), namely Not I and Mlu I, were used for overlapping analysis of the YAC clones mainly by the following two reasons: 1) V_(H) -carrying YAC clones can be arrayed with several other information such as comparison of the size or the pattern of the fragments hybridized with V_(H) probes or non-repetitive probes isolated from V_(H) -carrying cosmid clones, 2) it is necessary to subclone the YACs into cosmids for detailed structural analysis including construction of physical maps using ordinary restriction enzymes.

Gel blocks digested in completion with Not I or Mlu I were electrophoresed with a PFGE apparatus using a pulse time of 30 to 60 seconds depending on the length of YAC. Mixtures of lambda phage DNA, its Xho I digests and Hind III digests were also used as low molecular weight size markers. Southern filters were first hybridized with total human large molecular DNAs for detection of all restricted fragments. The sizes of detected bands were summed up to fit the length of undigested YAC insert. Filters were hybridized consecutively with pBR322 DNA probes corresponding to each of the pYAC4 arms. A Pvu II and Bam HI double digest of pER322 results in a 2.67-kb and 1.69-kb fragments which hybridize specifically to the left (trp) and the right (ura) end of YACs, respectively. Filters were also hybridized with six V_(H) family-specific probes for the presence of V_(H) segments in digested DNA fragments. Origin of V_(H) family-specific probes for V_(H-II), V_(H-IV), V_(H-V) and V_(H-VI) families, respectively, are; V_(CE-1) (N. Takahashi et al., Proc. Natl. Acad. Sci. USA, 81, 5194-5198 (1984)), V₇₁₋₂ (K.H. Lee et al., J. Mol. Biol., 195, 761-768 (1987)), 5-1R1 (J. E. Berman et al., EMBO J. 7, 727-1051 (1988) and 6-1R1 (J. E. Berman et al., EMBO J. 7, 727-1051 (1988)).

In order to array Not I and Mlu I fragments detected by the complete digestion experiments, hybridization experiments using partially digested YAC DNA were carried out. Analytical experiment was necessary to determine the optimal condition for partial digestion since the efficiency of the restriction enzyme reaction is highly dependent on the purity of DNA. In the DNA preparation in this example, 6-hour incubation with 1 unit of restriction enzyme was, in most cases, sufficient for complete digestion of a gel block (about 500 ng of DNA). Partial cleavage of DNA was achieved by varying the time of digestion as follows:

1. Dialyze three gel blocks (about 50 μl each volume containing about 1 μg of DNA, stored in 0.5 M EDTA (pH 8.0)) for 1 hour against 50 ml of distilled water at room temperature with gentle agitation. Repeat this step for complete removal of EDTA.

2. Equilibrate the blocks with 10 ml appropriate digestion buffer at 37° C. for 30 minutes.

3. Transfer each block to 250-μl reaction mixture containing 1 unit each of restriction enzyme in 1× digestion buffer.

4. Incubate all three tubes for 10 minutes, 30 minutes and 1 hour at 37° C.

5. Stop the reaction by adding 100 μl of 0.5 M EDTA (pH 8.0).

6. Equilibrate the blocks with appropriate gel electrophoresis buffer 2-3 times over a 1 hour period and immediately perform PFGE using an appropriate pulse time.

Filters were hybridized with the above-described right- or left-end probe of YAC vector and the size of the hybridized restriction fragments was determined by comparison with size standards (FIG. 3A). Results from complete and partial digestion experiments were combined to construct a physical map of YAC clones shown in FIG. 3B. Mapped clones were thus linked and classified into several contigs.

(9) Isolation of Insert-terminal Sequences from YACs

After isolated YAC clones were classified into several contigs based on their restriction maps, insert-terminal DNA segments were isolated from both ends of each contigs to synthesize oligonucleotide primers. As is often pointed out, considerable percentage (up to 30%) of YAC clones in libraries contain noncontiguous DNA segments spliced together resulting in "chimeric clone". Since no good strategies have been developed to exclude coligation artifact during the construction of the library, it is necessary to check this possibility with appropriate method after isolation of YAC clones. In this example, the strategy to investigate the possibility by using PCR with synthesized insert-terminal primers was taken. The reason is that the synthesized primers would be useful not only to investigate chimeric clones but also to register resulting sequences as sequence tagged sites (STS) for rescreening the YAC library by PCR. In addition, they could be used to look for overlaps between contigs which could not be found by comparison of their restriction maps.

For isolation of insert-terminal YAC segments, several different methods can be employed including more sophisticated and rapid method by inverse PCR and the Vectorette system (J. H. Riley et al., Nucleic Acids Res., 18, 2887-2890 (1990)). However, in this example, a rather classical way, that is, to subclone the fragments with plasmid or lambda phage vectors was taken. High molecular weight DNA from YAC clones was digested with restriction enzymes which have recognition sites both in right- and left-arm sequences. Gel electrophoresis was performed in a 0.7% agarose gel and Southern filter was hybridized with a 0.62-kb Hind III--Sal I fragment of pBR322 DNA (Tet^(R)) which specifically hybridizes with insert-vector boundary sequence of pYAC4 vector. The DNA fractions of interest were recovered from the gel using DE81 paper and ligated to either EMBL4 or pUC19 vector depending on the insert size. Isolated fragments with EMBL4 vector were subcloned into pUC19 vector for subsequent sequencing. The chain termination method with M13 forward or reverse primer was used for sequencing these plasmid clones. Sequences for insert-terminal primers were provided from the non-repetitive portion in the resulting sequence.

PCR experiments were achieved to investigate the above-mentioned artifact using primers at the both ends of YAC-DNA against the DNA from a human mouse somatic cell hybrid GM10479 line (Colier Institute) which carries human chromosome 14 alone in which the human IgH locus exists. DNA from CGM1 cells (source of YAC library) and Rag cells (mouse cell) were also used as positive and negative controls, respectively. PCR was carried out in 25-μl reactions according to a known method (H. S. Kim and O. Smithies, Nucleic Acids Res., 16, 8870-8903 (1988)). 200 ng each of DNA was used for the reaction. Incubations containing DNAs from GM10479, CGM1 and Rag, respectively were subjected to 35 to 40 cycles at 95° C., 1 minute--55 to 62° C., 2 minutes--72° C., 2 minutes according to the condition optimized by analytical experiment using CGM1 DNA. The YAC clones of which either of the two insert-terminal primers gave no specific amplification against GM10479 were concluded to be chimeric clones. Only one contig neither of which primers gave amplified bands was turned out to cover orphan V_(H) locus on chromosome 16.

(10) Cosmid Subcloning and Construction of Physical Maps

Isolation of large chromosomal region using YAC system is advantageous for the initial step of physical mapping. However, subsequent step to analyze large DNA fragments in YAC can be problematic since exogenous DNA inserts cannot be easily separated from yeast chromosomal DNA and fragments up to several hundred kb are difficult to handle without mechanical shearing. In order to map V_(H) segments of a large DNA fragment containing V_(H) segments, detailed restriction map using common 6bp-site restriction enzyme is necessary. For this purpose, YAC clones were subcloned into cosmids. Cosmid libraries were constructed from whole YAC DNA without previous separation of cloned DNA from host chromosome. There are two major reasons for this: 1) separation of intact insert DNA and their manipulation are difficult, 2) 4000 independent colonies are sufficient for complete coverage of YAC insert since the genome size of yeast is about 1.5×10⁴ bp, 1/200 of that of human.

In general, there are two major difficulties in the construction of cosmid libraries. The first is self-ligation of vector DNAs, resulting in generation of clones carrying no inserts of foreign DNA, and the second is insertion of more than one DNA fragments in a single vector, namely co-ligation artifact. To overcome these problems, great efforts have been made including construction of better-designed vectors with two cos sites and modified method for ligation such as partial filling of vector and insert DNAs (J. Sambrook et al., A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989)). Size fractionated insert DNA usually contains smaller DNA molecules trapped among larger molecules especially when excess amount of DNA was loaded in the preparative gel. Alkaline phosphatase treatment of insert DNA is effective in order to exclude the co-ligation between inserts but gives rise to polymerized vector DNA during the ligation step, which causes high background of empty colonies under the antibiotic selection. In this example, however, less than 5 μg of YAC DNA was sufficient for insert preparation and thus preparative gel electrophoresis was successful without contamination of smaller DNA fragments. Most of the cosmid libraries were thus constructed with minimal steps in combination with alkaline phosphatase-treated cosmid vector and partially digested DNA of exact size range for cosmid insert (from 35 kb to 45 kb).

1 Preparation of Yeast DNA Containing YAC

Since large DNA fragments are required as starting material for preparing the DNA, extraction of ENA from yeast cells with minimal shear damage is one of the most critical steps. Obviously, the best way is to manipulate DNA in-gel because DNA is fully protected from shear damage. The present inventors found, however, that gentle extraction of DNA in liquid from yeast cells gives sufficient length of DNA (>200 kb) for partial digestion and subsequent size fractionation. In addition, liquid DNA is easier to control the condition for partial digestion than gel block DNA. With a simple and rapid (6 hours for total procedure) method described below, about 50 μg of large size DNA (>200 kb) can be routinely purified from 100-ml yeast culture.

(i) Spin down yeast cells and wash them with TE (10 mM Tris HCl (pH 8.0)--1 mM EDTA) twice.

(ii) Resuspend the cells in 20 ml of 0.1 M EDTA (pH 7.5), 1 M sorbitol, 0.2 mg/ml of Zymolyase 100T (ICN Cat#152270), 15 mM 2-mercaptoethanol. Incubate at 37° C. for 1 hour to form spheroplasts.

(iii) Spin down the spheroplasts and resuspend in 9 ml of 0.1 M Tris HCl (pH 7.5), 50 mM EDTA (pH 7.5).

(iv) Add 1 ml (1/10 final volume) of 10% SDS and mix gently. Incubate at 60° C. for 10 to 20 minutes.

(v) Add 1/3 volume of 4 M potassium acetate and mix gently. Leave on ice for 30 minutes.

(vi) Centrifuge at 2000×g for 30 minutes and transfer the supernatant to a new tube. Add 3 volumes of isopropanol and mix gently. Leave at room temperature for 10 to 20 minutes for precipitation of DNA.

(vii) Centrifuge again at 2000×g for 30 minutes and discard supernatant. Dissolve the pellet in 10 ml of water.

From-this step onwards, care should be taken not to give shear damage to the DNA.

(viii) Extract with phenol twice and with CIAA (chloroform:isoamyl alcohol=24:1) twice followed by ethanol precipitation at room temperature for 10 to 20 minutes.

(ix) Centrifuge at 2000×g for 30 minutes. Rinse the pellet with 70% ethanol and dry up the pellet.

(x) Dissolve with 1 ml of TE.

2 Vector DNA Preparation

Lorist 2 DNA was linearized by digestion with Hind III or Bam HI. Linearized DNA was dephosphorylated by treatment with bacterial alkaline phosphatase. Small aliquots of DNA before and after phosphatase treatment were used for test ligation for phosphatase treatment according to a known method (J. Sambrook et al., supra).

3 Insert DNA Preparation

Analytical experiment of partial digestion of yeast DNA was performed according to standard procedure (J. Sambrook et al., supra) to determine the optimal enzyme concentration and reaction time. Preparation of size-fractionated DNA from the gel was achieved with LGT agarose and β agarase I. This very gentle method resulted in high recovery (>90%) of fractionated DNA without degradation. Scaled up cleavage reaction was done using 5 μg of DNA with optimal enzyme concentration. Digested samples were loaded in a preparative gel of 0.5% LGT agarose (Bio Rad Preparative grade) at about 1 V/cm overnight. Linearized lambda DNA and its Xho I-digests which give 35-kb and 15-kb bands were also loaded as size markers. After visualizing the DNA under ultraviolet transilluminater, a small slice of agarose containing the fraction ranging from 35 kb to 45 kb was cut out. Recovery of the DNA from the gel slice was achieved using p agarase I (NEB) as follows:

(i) Equilibrate the gel block with water for complete removal of gel electrophoresis buffer.

(ii) Transfer the block to a new tube and add 1/9 volume of 10×β agarase I buffer.

(iii) Melt the gel at 68° C. for 10 minutes. Cool to 40° C. and incubate the molten agarose at 40° C. for 1 hour with optimal number of units of β agarase I.

(iv) Adjust the salt concentration of the solution to 0.5 M NaCl for ethanol precipitation. Chill on ice for 10 minutes.

(v) Centrifuge at 15,000×g for 15 minutes to pellet any remaining undigested carbohydrates.

(vi) Transfer the DNA-containing supernatant to a new tube. Precipitate the DNA with 3 volumes of ethanol at -80° C. for 10 minutes.

(vii) Centrifuge at 15,000×g for 15 minutes and remove the supernatant. Rinse the pellet with 70% ethanol and dry up the pellet.

(viii) Resuspend the pellet in appropriate volume of water for subsequent manipulation.

With this method, in average 100 to 300 ng of size-fractionated DNA can be recovered.

4 Ligation, in vitro Packaging and Infection to E. coli

This process was performed according to standard procedure (J. Sambrook et al., supra). By using lambda inn packaging kit (Nippon Gene) and ED8768 host strain, about 10,000 colonies were obtained from 25 ng of ligated DNA.

5 Screening of Cosmid Libraries

Initial screening was carried out using Hind III-partial cosmid libraries. About 10,000 colonies (500 colonies per φ10 cm plate×20) were plated on LB plates containing 50 μg/ml of kanamycin so that single colonies can be picked up after first screening. Colonies were then lifted from the plates with φ8.2 cm detergent-free nitrocellulose membranes (Advantec Toyo Membrane) and subjected to colony hybridization. Three different kinds of probes were used for screening, namely mixture of six V_(H) -family specific probes to isolate V_(H) -containing cosmid clones, YAC vector probes (Tet^(R) gene segment of pBR322, described above) for isolation of insert-terminal cosmid clones, and total human DNA for any remaining cosmid clones. In average, 50 to 100 clones from a YAC clone with approximately 300-kb insert were isolated with the probes.

6 Construction of Cosmid Contigs

DNA from cosmid clones was isolated by the alkaline lysis method by a conventional method (J. Sambrook et al., supra). Purified cosmid DNAs were digested with Eco RI or Hind III and subjected to agarose gel electrophoresis for restriction mapping. Overlaps between clones were easily found by comparing restriction patterns among cosmid clones. Ordered cosmid clones were then cleaved with Eco RI or Hind III and loaded in a 0.7% agarose gel. Southern filters were hybridized with six V_(H) -family specific probes for identification of location and number of V_(H) segments in cosmid clones. Filters were washed three times for 30 minutes under standard conditions (at 50° C. in 1×SSC, 0.1% SDS) followed by stringent conditions (at 65° C. in 0.1×SSC, 0.1% SDS). Location of V_(H) segments were determined by comparison between hybridization pattern of cosmids and their physical maps.

Theoretically, approximately 50 independent cosmid clones (about 7 fold of the whole YAC insert) would be sufficient to cover the whole YAC insert of 300 kb in length. However, the distribution of cosmid clones were uneven and there still remained a few gaps. The clones corresponding to the gaps could not be isolated even after screening of Sau 3AI partial library or chromosomal walking by using the probes isolated from the edge of each contig. Regions not present in the cosmid libraries were subcloned with phage or plasmid vectors by isolation of DNA fragments of required size from YAC DNA as shown in FIG. 4. After the complete physical map was constructed, the present inventors found out that this was not due to the nonrandom distribution of restriction sites within the YAC insert. The presence of some classes of sequences such as palindromic or tandem repeat DNA might make these regions unclonable or under-represented by using cosmid system. The complete physical map of the 0.8-Mb region constructed in this example is shown in FIG. 1 as mentioned above. The distance from J_(H) of each V_(H) segment shown in FIG. 1 and the sizes of Eco RI and Hind III fragments are shown in Table 3.

Example 2 Construction of Cosmid Clones

A cosmid library was constructed from human high molecular DNAs as follows:

3-31: High molecular DNAs obtained from human placenta were partially digested with Taq I and the resultant was subjected to electrophoresis on 0.5% agarose gel. The 35-45-kb bands were recovered by using DEAE paper. The recovered DNAs were treated with alkaline phosphatase and the resultant was ligated to cosmid vector pJB8 which had been completely digested with a restriction enzyme Cla I. The ligation product was subjected to in vitro packaging and the resultant was infected to host E. coli 490A, followed by the screening by the conventional colony hybridization to obtain the clone.

M131, M84 and M118: These fragments were obtained by the same method as for 3-31 except that the DNA used was human pro B cell line FLEB14-14, the vector and the host E. coli used were Lorist 2 and ED8767, respectively, the combination of restriction enzymes employed was Xba I and Hind III, and the edges of the fragments were modified by the partial repairing. The partial repairing was carried out according to a known method (J. Sambrook, E. F. Fritsch and T. Maniatis, 1989, Molecular Cloning; a Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.).

Example 3 Sequencing Analysis of V_(H) Segments

Instead of sequencing subcloned V_(H) -containing DNA fragments using vector primers, V_(H) family-specific oligonucleotide primers were synthesized. As mentioned above, nucleotide sequences of FR regions of V_(H) segments are highly conserved within the same family, so the present inventors selected consensus sequences from the conserved portions and synthesized family-specific oligonucleotide primers for sequence analysis. For this purpose, automated fluorescence-based sequencing system Model 373A developed by Applied Biosystems was employed. Dye-Deoxy terminator sequencing kit supplied from the same company using fluorescent-dye labeled dideoxy nucleotides was suitable for our purpose since synthesized V_(H) -specific primers could be directly used without fluorescence-label.

(1) Subcloning of V_(H) -containing Restriction Fragments

In order to use V_(H) -family specific primers for sequencing, it is essential to subclone V_(H) -containing DNA fragments so that each plasmid contains only one V_(H) segment. Several other 6bp-site enzymes than Eco RI and Hind III were used to isolate single V_(H) -carrying DNA fragments. Plasmid DNA of the subcloned fragments was isolated by alkaline lysis method followed by ultracentrifugation twice to obtain high quality DNA samples for accurate sequences.

(2) Oligonucleotide Synthesis for Sequencing

To select consensus sequences for V_(H) family-specific oligonucleotide primer synthesis, nucleotide sequences of framework regions and exon-intron boundaries of the known V_(H) segments were aligned by family. Attention was paid so that 3'-half of them have 100% identities to reference sequences and 3'-most nucleotide corresponds to the first or the second letter of highly conserved/invariant amino acid residues. Nineteen additional primers were designed for five V_(H) families as shown in Table 1 (described below).

(3) Sequencing Reaction and Gel Electrophoresis

The sequencing reaction was performed by PCR using Dye-Deoxy terminator sequencing kit (ABI) according to manufacturer's instruction. Gel electrophoresis and detection of signals were done in the sequencing apparatus according to the users manual of the system. In average, sequences of over 350 bases were obtained from each reaction.

(4) Evaluation of Synthesized V_(H) Family-Specific Primers

The primers F-univ and I-R were first chosen to sequence V_(H-I) segments. As shown in Table 2, they annealed 11 of 12 V_(H-I) segments analyzed. It is to be noted that all of 6 functional V_(H-I) segments could be sequenced with these two primers. Two more primers, I-NF1 and I-NR1 were designed for V1-14P and V1-27P segments. These two primers were also used for some other V_(H) segments to verify their sequences obtained by first two primers (Table 2).

Eight primers were designed and used for sequencing V_(H-III) family segments. The first sequencing reaction of each V_(H) segment was performed with F-univ and III-R primers. They annealed more-than 80% of the V_(H-III) segments analyzed (25/30 for F-univ and 24/30 for III-R) (Table 2). Importantly, again, all the functional V_(H-III) segments with one exception could be sequenced with this combination of primers, suggesting that they would be good for most of V_(H-III) cDNA. Based on the nucleotide sequences obtained from first experiment, six additional primers (III-F3, III-R3, III-F4, III-R4, III-NF1 and III-F2) were designed and appropriate combination among them were used for further analysis. Among these, III-R3 and III-F4 were used to determine the sequence of 5' regulatory region and 3' flanking region, respectively. V3-29P and V3-32P were pseudogenes with extensive divergence in their sequences and thus all but. III-NF1 failed to anneal these two V_(H) segments. Sequences of V3-25P, V3-44P and V3-63P were determined using M13 vector primers from their internal restriction sites.

Five each of synthesized primers were used to determine the sequences of V_(H) segments belonging to V_(H-II), V_(H-IV) and V_(H-V) families. Since V_(H) segments belonging to each of these three families are highly homologous with each other, it was thought that four each of the primers are enough for most of the V_(H) segments belonging to these smaller V_(H) families. In fact, all four V_(H-II) family-specific primers annealed three V_(H-II) segments (V2-5, V2-10P and V2-26). In brief, in total 11 primers (F-univ and I-R for V_(H-I) ; II-R1, II-F2 and II-R:2 for V_(H-II) ; F-univ and III-R for V_(H-III) ; IV-R1, IV-F2 and IV-R2 for V_(H-IV) ; V-R2 and V-R3 for V_(H-V)) would be sufficient for sequencing most of the V_(H) segments belonging to five V_(H) families. The II-F1, III-NF2 and IV-F1 primers contain intron sequences and thus cannot be used for cDNA sequencing.

By this procedure, the DNA sequences of the 64 V_(H) segments were determined and they are shown in Sequence ID Nos. 1-64 as mentioned above. The distance of each V_(H) segment from J_(H) and the sizes of Eco RI and Hind III fragments are summarized in Table 3.

(5) Transcriptional Polarities of V_(H) Segments

The strategy for sequencing V_(H) segments with family-specific primers was not suitable for determination of transcriptional polarities of the V_(H) segments because it did not require restriction map of single V_(H) -containing subcloned fragments. The present inventors could not determine orientations of all the V_(H) segments within this region for that reason. The present inventors found, however, that 8 regions containing 21 V_(H) segments were already isolated in cosmid or phage clones since sequences between corresponding V_(H) segments as well as their restriction maps were identical with each other. As the relative orders of these 21 V_(H) segments within these clones are identical to those in the 0.8-Mb region, it was concluded that the orientation of these 21 V_(H) segments are the same as those of the J_(H) segments.

                                      TABLE 1                                      __________________________________________________________________________     VH family-specific primers used for screening and sequencing                                                              SEQ ID                                FAMILY NAME SEQUENCE (5' to 3') *LOCATION DIRECTION NOS                      __________________________________________________________________________     I, III, V                                                                             F - univ                                                                              AGGTGCAGCTGGTGCAGTCTG                                                                          1-8    forward                                                                              65                                     - I I - R CCAGGGGCCTGTCGCACCCA 36-42 reverse 66                                -  I - N F 1 TGGGGCCTCAGTGAAGGTCTCCTG 14-22 forward 67                         -  I - N R 1 GATCC(A/G)TCCCATCCACTCAAG 45-51 reverse 68/69                     - II II - F 1 TGTCTTCTCCACAGGGGTCTT intron-(-2) forward 70                     -  II - F 2 GGGAAGGCCCTGGAGTGGCT 42-48 forward 71                              -  II - R 1 GTGCAGGTCAGCGTGAGGGT 17-23 reverse 72                              -  II - R 2 TGGTTTTTGGAGGTGTCCTTGG 70-77 reverse 73                            - III III - R CACTCCAGCCCCTTCCCTGGAGC 40-47 reverse 74                         -  III - F 3 GTGAGGTTCAGCTGGTGGAGT (-I)-7 forward 75                           -  III - R 3 AGCTGAACCTCACACTGGAC (-3)-4 reverse 76                            -  III - F 4 AAGGGCCGATTCACCATCT 64-70 Forward 77                              -  III - R 4 TTGTCTCTGGAGATGGTGAA 68-73 reverse 78                             -  III - N F 1 TGAGACTCTCCTGTGCAGCCTCTG 18-26 forward 79                       -  III - N F 2 TCT(T/C)TGTGTTTGCAGGTGT intron-(-3) forward 80/81                                                         - IV IV - F 1 TCTGTTCACAGGGGT                                                CCTGTC intron-(-I) forward 82                                                    -  IV - F 2 TCCGGCAGCCCCCAGGG                                                AA 37-43 forward 83                    -  IV - R 1 GCAGGTGAGGGACAGGGT 17-22 reverse 84                                -  IV - R 2 CAGGGAGAACTGGTTCTTGGA 74-80 reverse 85                             - V V - R 1 CCCGGGCATCTGGCGCACCCA 36-42 reverse 86                             -  V - R 2 GCTGCTCCACTGCAGGTAGGC 78-82R reverse 87                             -  V - R 3 CTTCAGGCTGCTCCACTGCAG 74-83 reverse 88                           __________________________________________________________________________      *Locations of the primers are indicated as amino acid residue number           according to Kabat et al. Bases with redundancy are shown in the               parentheses. Directions relative to coding sequence are also shown.      

                                      TABLE 2                                      __________________________________________________________________________     List of useful primers for sequencing V.sub.H clones                                 V.sub.H-I primers                                                                         V.sub.H-III primers                                                                               V.sub.H-IV primers                         V.sub.H segments                                                                     univ                                                                              R NF1                                                                               NR1                                                                               univ                                                                              R F3                                                                               R3                                                                               F4                                                                               R4                                                                               NF1                                                                               NF2                                                                               F1                                                                               R1                                                                               F2                                                                               R2                                   __________________________________________________________________________     V.sub.H I                                                                        1-2 + +                                                                        1-3 + +                                                                        1-8 + +                                                                        1-12P + +                                                                      1-14P - + +                                                                    1-17P + + + +                                                                  1-18 + +  +                                                                    1-24P + + + +                                                                  1-27P + - + +                                                                  1-40P + +                                                                      1-45 + +                                                                       1-46 + +                                                                       V.sub.H III                                                                    3-6P     - + - + - + + +                                                       3-7     + +  + +                                                               3-9     + +  +                                                                 3-11      +  +   + +                                                           3-13     + - + + + +                                                           3-15     + +  + +                                                              3-16P     + +  +  +                                                            3-19P     + +    +                                                             3-20     + +  + + +                                                            3-21     +  + +  + +                                                           3-22P     + +  +                                                               3-23     + +  + + +                                                            3-29P     - - - - - - + -                                                      3-30     + +  + + +                                                            3-32P     - - - - - - + -                                                      3-33     + +   + +  +                                                          3-35     + +                                                                   3-36P     - + +                                                                3-37P     + -    +                                                             3-38P     + +                                                                  3-41P     + +                                                                  3-42P     + -  +  +                                                            3-43     + +                                                                   3-47P     + +                                                                  3-48     + +                                                                   3-49     + +                                                                   3-50P     - +      +                                                           3-52P     + +                                                                  3-53     + +                                                                   3-54P     + +   +                                                              3-64     + +                                                                   V.sub.H IV                                                                     4-4             + + + +                                                        4-31             + + - +                                                       4-34             + + + +                                                       4-39             + +                                                           4-55P             + +                                                        __________________________________________________________________________

                  TABLE 3                                                          ______________________________________                                         kb from           Fragment size(kb)                                            V.sub.H J.sub.H       EcoRI    Hind III                                        ______________________________________                                         6-1      75           0.9      25                                                1-2 125 7.2 12.5                                                               1-3 150 3.4 1.7                                                                4-4 160 5.1 8.0                                                                2-5 175 5.4 16.0                                                               3-6P 185 11.8 16.0                                                             3-7 190 2.2 5.0                                                                1-8 215 3.8 2.0                                                                3-9 230 2.6 5.4                                                                2-10P 235 13.5 18.5                                                            3-11 245 1.6 18.5                                                              1-12P 250 4.5 2.8                                                              3-13 260 1.7 5.8                                                               1-14P 275 2.9 13.0                                                             3-15 280 4.8 13.0                                                              3-16P 290 5.4 1.8                                                              1-17P 295 5.4 + 1.6 10.2                                                       1-18 315 3.4 8.8                                                               3-19P 330 4.3 14.7                                                             3-20 345 11.8 12.8                                                             3-21 360 2.2 6.8                                                               3-22P 385 5.7 7.0                                                              3-23 395 2.0 5.7                                                               1-24P 410 3.0 5.2                                                              3-25P 420 10.0 7.3                                                             2-26 430 8.1 6.6                                                               1-27P 450 8.3 11.3                                                             4-28 455 8.3 5.4                                                               3-29P 460 3.5 5.8                                                              3-30 470 9.8 6.8                                                               4-31 475 10.3 13.0                                                             3-32P 485 13.3 5.6                                                             3-33 490 13.3 6.8                                                              4-34 505 11.5 16.2                                                             3-35 520 5.3 3.2                                                               3-36P 525 5.3 5.7                                                              3-37P 540 7.5 13.2                                                             3-38P 545 8.0 15.4                                                             4-39 555 7.0 15.4                                                              1-40P 560 1.4 3.2                                                              3-41P 580 4.4 11.9                                                             3-42P 590 3.0 3.8                                                              3-43 600 6.5 8.1                                                               3-44P 610 8.8 17.0                                                             1-45 635 10.7 2.7                                                              1-46 640 2.0 4.6                                                               3-47P 650 2.7 10.5                                                             3-48 670 2.7 3.9                                                               3-49 690 1.6 16.5                                                              3-50P 695 10.0 16.5                                                            5-51 710 8.0 11.0                                                              3-52P 715 4.0 11.0                                                             3-53 725 8.3 6.3                                                               3-54P 730 6.4 15.4                                                             4-55P 735 3.9 15.4                                                             1-56P 740 3.4 15.4                                                             3-57P 745 9.7 6.6                                                              1-58P 750 8.3 17.5                                                             4-59 755 8.3 17.5                                                              3-60P 760 0.8 + 3.0 17.5                                                       4-61 770 8.1 9.0                                                               3-62P 775 4.6 9.0                                                              3-63P 780 8.9 6.2                                                              3-64 790 4.4 >7.4                                                            ______________________________________                                    

    __________________________________________________________________________     #             SEQUENCE LISTING                                                    - -  - - (1) GENERAL INFORMATION:                                              - -    (iii) NUMBER OF SEQUENCES: 145                                          - -  - - (2) INFORMATION FOR SEQ ID NO:1:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1429 base - #pairs                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                - - CTGATCTATG AATAAGGGTA TATAGACCAG TTTGGCCTGA TGTAGGGAAC GC -             #CAAAGTGC     60                                                                  - - TGGAATTTCA GAGTCATCAC ACCCAGGGGC CCTGCCTCTG AGCTCCTCTT TG -             #CATCCAAT    120                                                                  - - CTGCTGAAGA ACATGGCTCT AGGGAAACCC AGTTGTAGAC CTGAGGGCCC CG -             #GCTCTTCA    180                                                                  - - ATGAGCCATC TCCGTCCCGG GGCCTTATAT CAGCAAGTGA CGCACACAGG CA -             #AATGCCAG    240                                                                  - - GGTGTGGTTT CCTGTTTAAA TGTAGCCTCC CCCGCTGCAG AACTGCAGAG CC -             #TGCTGAAT    300                                                                  - - TCTGGCTGAC CAGGGCAGTC ACCAGAGCTC CAGACAATGT CTGTCTCCTT CC -             #TCATCTTC    360                                                                  - - CTGCCCGTGC TGGGCCTCCC ATGGGGTCAG TGTCAGGGAG ATGCCGTATT CA -             #CAGCAGCA    420                                                                  - - TTCACAGACT GAGGGGTGTT TCACTTTGCT GTTTCCTTTT GTCTCCAGGT GT -             #CCTGTCAC    480                                                                  - - AGGTACAGCT GCAGCAGTCA GGTCCAGGAC TGGTGAAGCC CTCGCAGACC CT -             #CTCACTCA    540                                                                  - - CCTGTGCCAT CTCCGGGGAC AGTGTCTCTA GCAACAGTGC TGCTTGGAAC TG -             #GATCAGGC    600                                                                  - - AGTCCCCATC GAGAGGCCTT GAGTGGCTGG GAAGGACATA CTACAGGTCC AA -             #GTGGTATA    660                                                                  - - ATGATTATGC AGTATCTGTG AAAAGTCGAA TAACCATCAA CCCAGACACA TC -             #CAAGAACC    720                                                                  - - AGTTCTCCCT GCAGCTGAAC TCTGTGACTC CCGAGGACAC GGCTGTGTAT TA -             #CTGTGCAA    780                                                                  - - GAGACACAGT GAGGGGAAGT CAGTGTGAGC CCAGACACAA ACCTCCCTGC AG -             #GGATGCTC    840                                                                  - - AGGACCCCAG AAGGCACCCA GCACTACCAG CGCAGGGCCC AGACCAGGAG CA -             #GGTGTGGA    900                                                                  - - GTTAAGCCAA AATGGAACTT CTTGCTGTGT CTTAAACTGT TGTTGTTTTT TT -             #TTTTTTTT    960                                                                  - - TGGCTCAGCA ACAGAGATCA TAGAAAACCC TTTTTCATAT TTTTCAAATC TG -             #TTCTTAGT   1020                                                                  - - CTAATGGAGA TTCTCTAATA TGTGACATTG TTTTTCTCTT GCTTGTTTTT GG -             #AATTCTTT   1080                                                                  - - GTCTTTGACT TTTGACAACT TGACTTTTGA CAGTGTGCCT CAAAGAAGTT CT -             #ATTTTGGG   1140                                                                  - - TTCTGTGAAC CTCCTGGATC TGGGAAGTTT TCAGCTATGA TTTCATTAAA CG -             #TGTTTTCT   1200                                                                  - - ACACCATTTC CCTACTTCTT TCCAATACCC ATAATGCAAA TATTTGTTCA CT -             #TAATTGTG   1260                                                                  - - TCCCATAAAT GCCTGGGGAT TTTCTTCATT CCTTTTTACT CTTTTTTTCT TT -             #TTATTCAT   1320                                                                  - - CTGCCTGAAT TATTTCAAAA GATCTGTCTT CAACTTCAGA AACTCTTTGG CT -             #TGGCCTAG   1380                                                                  - - TCTAATCTTG AAGGTCTCAA TTGTACTTTT AATTTCATTC ATTGAATTC  - #                  1429                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:2:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 512 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                - - TGAGAGCTCC GTTCCTCACC ATGGACTGGA CCTGGAGGAT CCTCTTCTTG GT -              #GGCAGCAG     60                                                                  - - CCACAGGTAA GAGGCTCCCT AGTCCCAGTG ATGAGAAAGA GATTGAGTCC AG -             #TCCAGGGA    120                                                                  - - GATCTCATCC ACTTCTGTGT TCTCTCCACA GGAGCCCACT CCCAGGTGCA GC -             #TGGTGCAG    180                                                                  - - TCTGGGGCTG AGGTGAAGAA GCCTGGGGCC TCAGTGAAGG TCTCCTGCAA GG -             #CTTCTGGA    240                                                                  - - TACACCTTCA CCGGCTACTA TATGCACTGG GTGCGACAGG CCCCTGGACA AG -             #GGCTTGAG    300                                                                  - - TGGATGGGAT GGATCAACCC TAACAGTGGT GGCACAAACT ATGCACAGAA GT -             #TTCAGGGC    360                                                                  - - AGGGTCACCA TGACCAGGGA CACGTCCATC AGCACAGCCT ACATGGAGCT GA -             #GCAGGCTG    420                                                                  - - AGATCTGACG ACACGGCCGT GTATTACTGT GCGAGAGACA CAGTGTGAAA AC -             #CCACATCC    480                                                                  - - TGAGGGTGTC AGAAACCCAA GGGAGGAGGC AG       - #                  - #              512                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:3:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 496 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                - - CACAACTCCT CACCATGGAC TGGACCTGGA GGATCCTCTT TTTGGTGGCA GC -              #AGCCACAG     60                                                                  - - GTAAGGGGCT GCCAAATCCC AGTGAGGAGG AAGGGACTGA AGCCAGTCAA GG -             #GGGCTTCC    120                                                                  - - ATCCACTCCT GTGTCTTCTC TACAGGTGTC CACTCCCAGG TTCAGCTGGT GC -             #AGTCTGGG    180                                                                  - - GCTGAGGTGA AGAAGCCTGG GGCCTCAGTG AAGGTTTCCT GCAAGGCTTC TG -             #GATACACC    240                                                                  - - TTCACTAGCT ATGCTATGCA TTGGGTGCGC CAGGCCCCCG GACAAAGGCT TG -             #AGTGGATG    300                                                                  - - GGATGGAGCA ACGCTGGCAA TGGTAACACA AAATATTCAC AGGAGTTCCA GG -             #GCAGAGTC    360                                                                  - - ACCATTACCA GGGACACATC CGCGAGCACA GCCTACATGG AGCTGAGCAG CC -             #TGAGATCT    420                                                                  - - GAGGACATGG CTGTGTATTA CTGTGCGAGA GACACAGTGT GAAAACCCAC AT -             #CCTGAGAG    480                                                                  - - TGTCAGAAAC CCCAGG             - #                  - #                       - #   496                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:4:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 650 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                - - CACAGGAAAC CACCACACAT TTCCTTAAAT TCAGGGTCCA GCTCACATGG GA -              #AATACTTT     60                                                                  - - CTGAGACTCA TGGACCTCCT GCACAAGAAC ATGAAACACC TGTGGTTCTT CC -             #TCCTGCTG    120                                                                  - - GTGGCAGCTC CCAGATGTGA GTGTCTCAAG GCTGCAGACA TGGGATATGG GA -             #GGTGCCTC    180                                                                  - - TGATCCCAGG GCTCACTGTG GGTCTCTCTG TTCACAGGGG TCCTGTCCCA GG -             #TGCAGCTG    240                                                                  - - CAGGAGTCGG GCCCAGGACT GGTGAAGCCT TCGGAGACCC TGTCCCTCAC CT -             #GCACTGTC    300                                                                  - - TCTGGTGGCT CCATCAGTAG TTACTACTGG AGCTGGATCC GGCAGCCCGC CG -             #GGAAGGGA    360                                                                  - - CTGGAGTGGA TTGGGCGTAT CTATACCAGT GGGAGCACCA ACTACAACCC CT -             #CCCTCAAG    420                                                                  - - AGTCGAGTCA CCATGTCAGT AGACACGTCC AAGAACCAGT TCTCCCTGAA GC -             #TGAGCTCT    480                                                                  - - GTGACCGCCG CGGACACGGC CGTGTATTAC TGTGCGAGAG ACACAGTGAG GG -             #GAGGTGAG    540                                                                  - - TGTGAGCCCA GACACAAACC TCCCTGCAGG GAGGCGGAGG GGACCGGCGC AG -             #GTGCTGCT    600                                                                  - - CAAGACCAGC AGGGGGCGCG CGGGGCCCAC AGAGCAAGAG GCCGGGTCAG  - #                  650                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:5:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 613 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                - - CCAGCTCCAC CCTCCTCTGG GTTGAAAAAG CCGAGCACAG GTACCAGCTC AG -              #TGACTCCT     60                                                                  - - GTGCACCACC ATGGACACAC TTTGCTCCAC GCTCCTGCTG CTGACCATCC CT -             #TCATGTGA    120                                                                  - - GTGCTGTGGT CAGGGACTCC TTCACGGGTG AAACATCAGT TTTCTTGTTT GT -             #GGGCTTCA    180                                                                  - - TCTTCTTATG CTTTCTCCAC AGGGGTCTTG TCCCAGATCA CCTTGAAGGA GT -             #CTGGTCCT    240                                                                  - - ACGCTGGTGA AACCCACACA GACCCTCACG CTGACCTGCA CCTTCTCTGG GT -             #TCTCACTC    300                                                                  - - AGCACTAGTG GAGTGGGTGT GGGCTGGATC CGTCAGCCCC CAGGAAAGGC CC -             #TGGAGTGG    360                                                                  - - CTTGCACTCA TTTATTGGAA TGATGATAAG CGCTACAGCC CATCTCTGAA GA -             #GCAGGCTC    420                                                                  - - ACCATCACCA AGGACACCTC CAAAAACCAG GTGGTCCTTA CAATGACCAA CA -             #TGGACCCT    480                                                                  - - GTGGACACAG CCACATATTA CTGTGCACAC AGACCACAAA GACACAGCCC AG -             #GGCACCTC    540                                                                  - - CTGTACAAAA ACCCAGGCTG CTTCTCATTG GTGCTCCCTC CCCACCTCTG CA -             #GAACAGGA    600                                                                  - - AAGTCTGTCT GCT              - #                  - #                       - #     613                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:6:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 594 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                - - ACAGGATTCA CCATGGAGTT GGGGCTGAGG TGGGTTTTCC TTGCTGCTAT TT -              #TAAAAGGT     60                                                                  - - GATTTATGGT TAACTAGAGC TATTGAGTGT GAATGGACAT AAGTGAGCGA AA -             #CAGTGGAT    120                                                                  - - ATGTGTGGCA GTTTCTTACC AGGATGTCTC TGTGTTTGCA GGTGTCCAGT GT -             #GAGATGCA    180                                                                  - - GCTGGTAGAG TCTGGAGCAA ACTTGACAAA GCCTGGGTGT CCCTGAGACT CT -             #CCTGTGCA    240                                                                  - - GCCTCTGGAT TCACCTTCAG TAGCCATAGC ACGCACTGGG TCCCCCAGGC TC -             #CAGGGAAG    300                                                                  - - GGTCTGCAGT GGGTCCCAGT TATTAGTGGT AGTGGTAGTA CCATGTACTA CG -             #CAGACTCT    360                                                                  - - GTGAAGGGCC GATTCACCAT TTCCAGAGAC AATACCAAAA ACTCACTGTA TC -             #TGCAAATG    420                                                                  - - AACAGACTGA GGGCAGAGGA TGCAGCTGCA TATGACTCTG TGAGAGATAC GG -             #TAAGGAGA    480                                                                  - - AGTCAGTGTG AGCCCAGACA CAAACCTCCC TTCAGGGTAC CTGGGACAAC CA -             #GGGAAAGC    540                                                                  - - CTGGGACACT GTGCACTGTG CTGACCCCAG GGGCAAGTGC AGGTGCTACA AG - #GG               594                                                                        - -  - - (2) INFORMATION FOR SEQ ID NO:7:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 877 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                - - ACAGCCTATT CCTCCAGCAT CCCACTAGAG CTTCTTATAT AGTAGGAGAC AT -              #GCAAATAG     60                                                                  - - GGCCCTCCCT CTACTGATGA AAACCAACCC AACCCTGACC CTGCAGGTCT CA -             #GAGAGGAG    120                                                                  - - CCTTAGCCCT GGACTCCAAG GCCTTTCCAC TTGGTGATCA GCACTGAGCA CA -             #GAGGACTC    180                                                                  - - ACCATGGAAT TGGGGCTGAG CTGGGTTTTC CTTGTTGCTA TTTTAGAAGG TG -             #ATTCATGG    240                                                                  - - AAAACTAGGA AGATTGAGTG TGTGTGGATA TGAGTGTGAG AAACAGTGGA TT -             #TGTGTGGC    300                                                                  - - AGTTTCTGAC CTTGGTGTCT CTTTGTTTGC AGGTGTCCAG TGTGAGGTGC AG -             #CTGGTGGA    360                                                                  - - GTCTGGGGGA GGCTTGGTCC AGCCTGGGGG GTCCCTGAGA CTCTCCTGTG CA -             #GCCTCTGG    420                                                                  - - ATTCACCTTT AGTAGCTATT GGATGAGCTG GGTCCGCCAG GCTCCAGGGA AG -             #GGGCTGGA    480                                                                  - - GTGGGTGGCC AACATAAAGC AAGATGGAAG TGAGAAATAC TATGTGGACT CT -             #GTGAAGGG    540                                                                  - - CCGATTCACC ATCTCCAGAG ACAACGCCAA GAACTCACTG TATCTGCAAA TG -             #AACAGCCT    600                                                                  - - GAGAGCCGAG GACACGGCTG TGTATTACTG TGCGAGAGAC ACAGTGAGGG GA -             #AGTCAGTG    660                                                                  - - TGAGCCCAGA CACAAACCTC CCTGCAGGGG TCCCTTGGGA CCACCAGGGG GC -             #GACAGGGC    720                                                                  - - ATTGAGCACT GGGCTGTCTC CAGGGCAGGT GCAGGTGCTG CTGAGGGCTG GC -             #TTCCTGTC    780                                                                  - - GCGGTCTGGG GCTGCCTCGT CGTCAAATTT CCCCAGGAAC TTCTCCAGAT TT -             #ACAATTCT    840                                                                  - - GTACTGACAT TTCATGTCTC TAAATGCAAT ACTTTTT      - #                       - #     877                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:8:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 564 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                - - CACTCCACCA ACCACATCTG TCCTCTAGAG AAAACCCTGT GAGCACACCT CC -              #TCACCATG     60                                                                  - - GACTGGACCT GGAGGATCCT CTTCTTGGTG GCAGCAGCTA CAAGTAAGGG GC -             #TTCCTAGT    120                                                                  - - CTCAAAGCTG AGGAACGGAT CCTGGTTCAG TCAAAGAGGA TTTTATTCTC TC -             #CTGTGTTC    180                                                                  - - TCTCCACAGG TGCCCACTCC CAGGTGCAGC TGGTGCAGTC TGGGGCTGAG GT -             #GAAGAAGC    240                                                                  - - CTGGGGCCTC AGTGAAGGTC TCCTGCAAGG CTTCTGGATA CACCTTCACC AG -             #TTATGATA    300                                                                  - - TCAACTGGGT GCGACAGGCC ACTGGACAAG GGCTTGAGTG GATGGGATGG AT -             #GAACCCTA    360                                                                  - - ACAGTGGTAA CACAGGCTAT GCACAGAAGT TCCAGGGCAG AGTCACCATG AC -             #CAGGAACA    420                                                                  - - CCTCCATAAG CACAGCCTAC ATGGAGCTGA GCAGCCTGAG ATCTGAGGAC AC -             #GGCCGTGT    480                                                                  - - ATTACTGTGC GAGAGGCACA GTGTGAAAAA CCACATCCTC AGAGAGTCAG AA -             #ACCCCTAG    540                                                                  - - GGGAGAAGGC AGCTTCTGCT GGGC          - #                  - #                    564                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:9:                                      - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 640 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                - - CAAATAGGGC CCTCCCTCTG CTGATGAAAA CCAGCCCAGC CCTGACCCTG CA -              #GCTCTGGG     60                                                                  - - AGAGGAGCCC CAGCCCTGAG ATTCCCAGGT GTTTCCATTC AGTGATCAGC AC -             #TGAACACA    120                                                                  - - GAGGACTCAC CATGGAGTTG GGACTGAGCT GGATTTTCCT TTTGGCTATT TT -             #AAAAGGTG    180                                                                  - - ATTCATGGAG AAATAGAGAG ATTGAGTGTG AGTGGACATG AGTGGATTTG TG -             #TGGCAGTT    240                                                                  - - TCTGACCTTG GTGTCTCTGT GTTTGCAGGT GTCCAGTGTG AAGTGCAGCT GG -             #TGGAGTCT    300                                                                  - - GGGGGAGGCT TGGTACAGCC TGGCAGGTCC CTGAGACTCT CCTGTGCAGC CT -             #CTGGATTC    360                                                                  - - ACCTTTGATG ATTATGCCAT GCACTGGGTC CGGCAAGCTC CAGGGAAGGG CC -             #TGGAGTGG    420                                                                  - - GTCTCAGGTA TTAGTTGGAA TAGTGGTAGC ATAGGCTATG CGGACTCTGT GA -             #AGGGCCGA    480                                                                  - - TTCACCATCT CCAGAGACAA CGCCAAGAAC TCCCTGTATC TGCAAATGAA CA -             #GTCTGAGA    540                                                                  - - GCTGAGGACA CGGCCTTGTA TTACTGTGCA AAAGATACAC AGTGAGGGGA AG -             #TCAGCGAG    600                                                                  - - AGCCCAGACA AAAACCTCCT GCAGGAAGAC AGGAGGGGCC     - #                       - #   640                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:10:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 630 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                               - - AGCTCCACCC TTCTCTGTGT TGAAAAGCCG AGCATGGGGA CCTAGTTCAG TG -              #ACTCCTGC     60                                                                  - - GCCCCACCAC ATGGAGCTTT ACTCCACGCT TCTCCTGCTG ACTGTCCCTT CC -             #TGTGAGTT    120                                                                  - - CAGTGGTCAG GGAATCCTTC AGGGGTGAAA CACCTGTTCT TTTCTTTGTG GG -             #CTTCATCT    180                                                                  - - TCTTATGCTT TCTCCACAGG GGTCTTATCC CAGGTCACCT TGAAGGAGTC TG -             #GTCCTGCA    240                                                                  - - CTGGTGAAAC CCACACAGAC CCTCATGCTG ACCTGCACCT TCTCTGGGTT CT -             #CACTCAGC    300                                                                  - - ACTTCTGGAA TGGGTGTGGG TTAGATCTGT CAGCCCTCAG CAAAGGCCCT GG -             #AGTGGCTT    360                                                                  - - GCACACATTT ATTAGAATGA TAATAAATAC TACAGCCCAT CTCTGAAGAG TA -             #GGCTCATT    420                                                                  - - ATCTCCAAGG ACACCTCCAA GAATGAAGTG GTTCTAACAG TGATCAACAT GG -             #ACATTGTG    480                                                                  - - GACACAGCCA CACATTACTG TGCAAGGAGA CCACAGAGAC AGAGCCCAGG GT -             #GCCTCTTG    540                                                                  - - TACAAGACCC AGGCTGCTTC TCAGTGGCGC TCCCTCCCCA CCTCTGCAGA AC -             #AGGAAAGT    600                                                                  - - GTGGCTGAGA TGCCATTTCC TGTCAGGGTC         - #                  - #               630                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:11:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 715 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                               - - CACCCCAGGC TTTACACTTT ATGCTTCCGG CTCGTATGTT GTGTGGAATT GT -              #GAGCGGAT     60                                                                  - - AACAATTTCA CACAGGAAAC AGCTATGACC ATGATTACGC CAAGCTTGCA TG -             #CCTGCAGG    120                                                                  - - TCGACTCTAG AGGATCCCCG GGTACCGAGC TCGAATTCCC AGGAGTTTCC AT -             #TCGGTGAT    180                                                                  - - CAGCACTGAA CACAGAGGAC TCACCATGGA GTTTGGGCTG AGCTGGGTTT TC -             #CTTGTTGC    240                                                                  - - TATAATAAAA GGTGATTTAT GGAGAACTAG AGACATTGAG TGGACGTGAG TG -             #AGATAAGC    300                                                                  - - AGTGAATATA TGTGGCAGTT TCTGACTAGG TTGTCTCTGT GTTTGCAGGT GT -             #CCAGTGTC    360                                                                  - - AGGTGCAGCT GGTGGAGTCT GGGGGAGGCT TGGTCAAGCC TGGAGGGTCC CT -             #GAGACTCT    420                                                                  - - CCTGTGCAGC CTCTGGATTC ACCTTCAGTG ACTACTACAT GAGCTGGATC CG -             #CCAGGCTC    480                                                                  - - CAGGGAAGGG GCTGGAGTGG GTTTCATACA TTAGTAGTAG TGGTAGTACC AT -             #ATACTACG    540                                                                  - - CAGACTCTGT GAAGGGCCGA TTCACCATCT CCAGGGACAA CGCCAAGAAC TC -             #ACTGTATC    600                                                                  - - TGCAAATGAA CAGCCTGAGA GCCGAGGACA CGGCCGTGTA TTACTGTGCG AG -             #AGACACAG    660                                                                  - - TGAGGGGAAG TCAGTGTGAG CCCAGACACA AACCTCCCTG CAGGGGGTCC CT - #TGG              715                                                                        - -  - - (2) INFORMATION FOR SEQ ID NO:12:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 660 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                               - - GGATTGGGCT TTGAGCTAAG GANAGGCTTT GTCNNATGAA TATNCGAATA TA -              #CTGATATC     60                                                                  - - CACTGAGNTG AATATGTTCT GTNCCCTGAG AGAATCACCT GAGAGAATCC CC -             #TGAGAGCA    120                                                                  - - CATCTCCTCA TGGNCTGGAC CTACAAGATC CTCTTCTTGG TGGCAGCAGC CA -             #CAGGTAAG    180                                                                  - - CAGTTCCCAG GTCCAAGTAA TGAGGAGGGG ATTGAGTCCA GTCAAGGGGG CT -             #TTCATCCA    240                                                                  - - CTCCTGTGTC CTCCCCACAG GTGCCCACTC CCAGGTGCAG CTGGTGCAAT CT -             #GGGGCTGA    300                                                                  - - GGTGAAGAAG CCTGGGGCCT CAGTGAAGGT CTCCTGCAAG GCTTCTGGAT AC -             #ACCTTCAC    360                                                                  - - CTACTGCTAC TTGCACTGGG TATGACAGGC CCCTGGACAA GGGCTTGAAT GG -             #ACAGGATT    420                                                                  - - TTAGTTATTT GAGAGATTTT TCATACAACA TTTATTCTGT AAGCAAATTT CA -             #GGGATTGT    480                                                                  - - AGAATGAATC ATATTAACAA ATCTGACACA GAACTTCCTC TGAATCAATC TT -             #TGTAAACA    540                                                                  - - TCAATTTCTG AATCAATGTT GTNAATATTT CAGAACACAA GCACAANTTC AC -             #ATTTNAAC    600                                                                  - - TCTACTTTNA TCTCTATTTA AAANATATCA AAAANTCTCA TCNNGTGCAT GT -             #AACGTTTG    660                                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:13:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 819 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                               - - AATAAAAAAA TGATAGTTGT TAAATGTTTA TCGCAGAACA ATTCCAAATA AG -             #GCAGCATT     60                                                                  - - TTCCCCAAAT ACAATCATTG TCATCCAAAA TCCCCCAGGA CGCTCTCATC TA -             #CTCTGCCC    120                                                                  - - CTGCCTTCAC CTCAGATGTC CCACCCCAGA GCTTGCTATA TAGTAACAGA CA -             #TGCAAATA    180                                                                  - - GTTGACTCCC TCTCCTGATG AAAACCAGCC CAGCCCTGAC CCTGCAGCTC TG -             #GGAGTGGA    240                                                                  - - GCCCCAGCCT TGGGATTCCC AAGTGTTTGT ATTCAGTGAT CAGGACTGAA CA -             #CACAGGAC    300                                                                  - - TCACCATGGA GTTGGGGCTG AGCTGGGTTT TCCTTGTTGC TATATTAGAA GG -             #TGATTCAT    360                                                                  - - GGAGAACTAG AGATATTGAG TGTGAATGGG CATGAATGAG AGAAACAGTG GG -             #TATGTGTG    420                                                                  - - GCAATTTCTG ACTTTTGTGT CTCTGTGCCT TGCAGGTGTC CAGTGTGAGG TG -             #CATCTGGT    480                                                                  - - GGAGTCTGGG GGAGGCTTGG TACAGCCTGG GGGGGCCCTG AGACTCTCCT GT -             #GCAGCCTC    540                                                                  - - TGGATTCACC TTCAGTAACT ACGACATGCA CTGGGTCCGC CAAGCTACAG GA -             #AAAGGTCT    600                                                                  - - GGAGTGGGTC TCAGCCAATG GTACTGCTGG TGACACATAC TATCCAGGCT CC -             #GTGAAGGG    660                                                                  - - GCGATTCACC ATCTCCAGAG AAAATGCCAA GAACTCCTTG TATCTTCAAA TG -             #AACAGCCT    720                                                                  - - GAGAGCCGGG GACACGGCTG TGTATTACTG TGCAAGAGAC ACAGTGAGGG GA -             #AGTCAGTA    780                                                                  - - TGAGCCCAGA CACAAACCTC CCTGCAGAAT GCCTGGGGG      - #                       - #   819                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:14:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 816 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                               - - AGNGANGAAG GNAGTGATCA CTGTGATCTT TTCNCCAAGT TCACCATTTC NC -              #TGAAGGTG     60                                                                  - - AGCACAGGTC CTCCTGCATG TGTTCAAACA AAAGNNNNAG AGACTACCTG GT -             #AAGTGAGG    120                                                                  - - TGCTCACCTG GTTCTGGATG TTTGGTCTGT CTCCTCCCCT CTGTTGCCCC AC -             #ACAAGGTC    180                                                                  - - AGCCCACTCT TTCCAGGTCC GAAGAAGAGA GCACAGGTTT GTCCTGATTA TA -             #TGACTCAC    240                                                                  - - CCAGCTTCTG ATGACTCTCC TGTTGCCAGC GTCCATGGCC TCAGTGAAGG TC -             #TCCTGCAA    300                                                                  - - AGCTCTGGAT ACACCTTCGC CAGCTACGAC ATTCACTGTG TGTGACAGGC CC -             #CTGGATAA    360                                                                  - - GGGTTTGANT GGATGGTAGG GAGCTACTCT GGCAATGGTA ACACAGGCTA TG -             #CACAGAAG    420                                                                  - - TTTCAGGGCA GAGTCACCAT GACCAGGGAC ACGTCCACGA GCACAGCCTA CA -             #TGGAGCTG    480                                                                  - - AGCAGTCAGA GATCTGAGGA CATAGATGTG TACTACTGTG CGANACACAC AG -             #TGTGACAN    540                                                                  - - CCCACATCCT GAGAGAGTCA GAAATCCTGA GGGAGGTGGC AGCAGTGCTA GG -             #CTTGAGAG    600                                                                  - - ATGACAGGGA TTTTATTTGC TTTNNCGGCT TTTTTTNGNN AGCGAGGTTA NT -             #TCATTACA    660                                                                  - - GANNNNNGGA AAATAGAAAT GTGTATGGAC TCTAATTATG TGGGAAATTT CC -             #ATACAACT    720                                                                  - - TTGGTTCTCT TNGNNNNTTC AGGGGTNGGA NNCAATCAAT TAATAACCTG AT -             #AAAGATTC    780                                                                  - - GAGTCGTACC CNGGATCCCT GNTTCGCCTG AGNATA      - #                        - #      816                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:15:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 535 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                               - - CACAGAGGAC TCACCATGGA GTTTGGGCTG AGCTGGATTT TCCTTCCTGC TA -             #TTTTAAAA     60                                                                  - - GGTGATTTAT GGAGAACTAG AGAGATTAAG TGTGAGTGGA CGTGAGTGAG AG -             #AAACAGTG    120                                                                  - - GATATGTGTG GCAGTTTCTG ATCTTAGTGT CTCTGTGTTT GCAGGTGTCC AG -             #TGTGAGGT    180                                                                  - - GCAGCTGGTG GAGTCTGGGG GAGCCTTGGT AAAGCCTGGG GGGTCCCTTA GA -             #CTCTCCTG    240                                                                  - - TGCAGCCTCT GGATTCACTT TCAGTAACGC CTGGATGAGC TGGGTCCGCC AG -             #GCTCCAGG    300                                                                  - - GAAGGGGCTG GAGTGGGTTG GCCGTATTAA AAGCAAAACT GATGGTGGGA CA -             #ACAGACTA    360                                                                  - - CGCTGCACCC GTGAAAGGCA GATTCACCAT CTCAAGAGAT GATTCAAAAA AC -             #ACGCTGTA    420                                                                  - - TCTGCAAATG AACAGCCTGA AAACCGAGGA CACAGCCGTG TATTACTGTA CC -             #ACAGACAC    480                                                                  - - AGTGAGGGGA GGTCAGTGTG AGCCCGGACA CAAACCTCCC TGCAGGGGCG CG - #CGG              535                                                                        - -  - - (2) INFORMATION FOR SEQ ID NO:16:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 542 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                               - - ATTGGGTCAA CAGCAATAAA CAAATTACCA TGGAATTTGG GCTGAGCTGG GT -              #TTTTCTTG     60                                                                  - - CTGGTATTTT AAAAGGTGAT TCATGGAGAA CTAAGGATAT TGAGTGAGTG GA -             #CATGAGTG    120                                                                  - - AGAGAAACAG TGGATATGTG TGGCAGTTTC TGACCAGGGT GTCTCTGTGT TT -             #GCAGGTGT    180                                                                  - - CCAGTGTGAG GTACAACTGG TGGAGTCTGG GGGAGGCTTG GTACAGCCTG GG -             #GGGTCCCT    240                                                                  - - GAGACTCTCC TGTGCAGCCT CTGGATTCAC CTTCAGTAAC AGTGACATGA AC -             #TGGGCCCG    300                                                                  - - CAAGGCTCCA GGAAAGGGGC TGGAGTGGGT ATCGGGTGTT AGTTGGAATG GC -             #AGTAGGAC    360                                                                  - - GCACTATGTG GACTCCGTGA AGCGCCGATT CATCATCTCC AGAGACAATT CC -             #AGGAACTC    420                                                                  - - CCTGTATCTG CAAAAGAACA GACGGAGAGC CGAGGACATG GCTGTGTATT AC -             #TGTGTGAG    480                                                                  - - AAATCCTGTG AGGGGACACA AGTGCGAGCC CAGACACAAA CCTCCTGCAG GA -             #ACACTGGG    540                                                                  - - CG                  - #                  - #                  - #                  542                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:17:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 591 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                               - - ACATCCCTCC TCTATAGAAG CCCCTGAGAG CACAGCTCCT CACCATGGAC TG -              #TACCTGGG     60                                                                  - - GGATCCTCTT CTTGGTGGCA TCTNCCACAG GTAAGGGGCT CCCAAGTCCT AG -             #TGATGAGG    120                                                                  - - AGGGGATTGA GTCCAGTCAA GGGGGCTTTT ATCATCTCCT CCCTTCTCCT CA -             #CAGATGTC    180                                                                  - - CATTCCCAGG TTCAGCTGTT GCAGCCTGGG GCTGAGGTGA AGAAGCCTGC GT -             #CCTCAGTG    240                                                                  - - AAGGTCTCCT GGCCAGGCTT CCAGATACAC CTTCACCAAA TACTTTACAC AG -             #TGGGTGCG    300                                                                  - - ACAGGGCCCT GGACAAGGGC ATAGTGGTTG GGATGCATCA ACCCTTACAA TG -             #ATAACACA    360                                                                  - - CACTACGCAC AGAAGTTCCG GGGCAGAGTC ACCATTACCA GTGACAGGTC CG -             #TGAGCACA    420                                                                  - - GCCTACATGG AGCTGAGCAG TCTGAGATCT GAAGACATGG TCGTGTATTC CT -             #GTGTGAGA    480                                                                  - - GACACAGTGC GAAAACCCAC ATCCTGAGAG TGTCAGAAAC CCCAGGAAGG AG -             #GCACCTGT    540                                                                  - - GCTGACACAG AGGGAGATGA CAAAGATTAT TAGATTAACG ATTTTCTTAG A - #                 591                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:18:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 539 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                               - - CAAACACCCC TCCTTGGGAG AATCCCCTAG ATCACAGCTC CTCACCATGG AC -              #TGGACCTG     60                                                                  - - GAGCATCCTT TTCTTGGTGG CAGCACCAAC AGGTAACGGA CTCCCCAGTC CC -             #AGGGCTGA    120                                                                  - - GAGAGAAACC AGGCCAGTCA TGTGAGACTT CACCCACTCC TGTGTCCTCT CC -             #ACAGGTGC    180                                                                  - - CCACTCCCAG GTTCAGCTGG TGCAGTCTGG AGCTGAGGTG AAGAAGCCTG GG -             #GCCTCAGT    240                                                                  - - GAAGGTCTCC TGCAAGGCTT CTGGTTACAC CTTTACCAGC TATGGTATCA GC -             #TGGGTGCG    300                                                                  - - ACAGGCCCCT GGACAAGGGC TTGAGTGGAT GGGATGGATC AGCGCTTACA AT -             #GGTAACAC    360                                                                  - - AAACTATGCA CAGAAGCTCC AGGGCAGAGT CACCATGACC ACAGACACAT CC -             #ACGAGCAC    420                                                                  - - AGCCTACATG GAGCTGAGGA GCCTGAGATC TGACGACACG GCCGTGTATT AC -             #TGTGCGAG    480                                                                  - - AGACACAGTG TGAAAACCCA CATCCTGAGG GTTTCAGAAA CCCCAGGGAG GA -             #GGCAGCT     539                                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:19:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 727 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                               - - AGATTTAAGA ACCTTGCACC TGGTACCCGT TGCTCTTCTT GTAACCATTT GT -             #CTTTTAAG     60                                                                  - - TTGTTTATCA CTCTGTAACT ATTTTGATTA TTTTGATTCT TGCATGTTTT TA -             #CTTCTGTA    120                                                                  - - AAATTATTAC ATTTGAGTCC CTCTCCCCTT CCTAAACCTA GGTATAAAAT TT -             #ACTCGAGC    180                                                                  - - CCCTTCCTCG TGGCCGAGAG AATTTTGAGC ATGAGCTGTC TCTTTGGCAG CC -             #GGCTTAAT    240                                                                  - - AAAGGACTCT TAATTCGTCT CAAAGTGTGG CGTTTTCTTA ACTCACCTGG GT -             #ACAACAGT    300                                                                  - - GCAGCTGGTG GAGTCTGGGG GAGGCTTGGT AGAGCCTGGG GGGTCCCTGA GA -             #CTCTCCTG    360                                                                  - - TGCAGCCTCT GGATTCACCT TCAGTAACAG TGACATGAAC TGGGTCCGCC AG -             #GCTCCAGG    420                                                                  - - AAAGGGGCTG GAGTGGGTAT CGGGTGTTAG TTGGAATGGC AGTAGGACGC AC -             #TATGCAGA    480                                                                  - - CTCTGTGAAG GGCCGATTCA TCATCTCCAG AGACAATTCC AGGAACTTCC TG -             #TATCAGCA    540                                                                  - - AATGAACAGC CTGAGGCCCG AGGACATGGC TGTGTATTAC TGTGTGAGAA AC -             #ACTGTGAG    600                                                                  - - AGGACGGAAG TGTGAGCCCA GACACAAACC TCCTGCAGGA ACGTTGGGGG AA -             #ATCAGCTG    660                                                                  - - CAGGGGGCGC TCAAGACCCA CTCATCAGAG TCAACCCCAG AGCAGGTGCA CA -             #TGGAGGCT    720                                                                  - - GGGTTTT                 - #                  - #                        - #         727                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:20:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 514 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                               - - GGACTCGCCA TGGAGTTTGG GCTGAGCTGG GTTTTCCTTG TTGCTATTTT AA -             #AAGGTGAT     60                                                                  - - TCATGGATCA ATAGAGATGT TGAGTGTGAG TGAACACGAG TGAGAGAAAC AG -             #TGGATTTG    120                                                                  - - TGTGGCAGTT TCTGACCAGG GTGTCTCTGT GTTTGCAGGT GTCCAGTGTG AG -             #GTGCAGCT    180                                                                  - - GGTGGAGTCT GGGGGAGGTG TGGTACGGCC TGGGGGGTCC CTGAGACTCT CC -             #TGTGCAGC    240                                                                  - - CTCTGGATTC ACCTTTGATG ATTATGGCAT GAGCTGGGTC CGCCAAGCTC CA -             #GGGAAGGG    300                                                                  - - GCTGGAGTGG GTCTCTGGTA TTAATTGGAA TGGTGGTAGC ACAGGTTATG CA -             #GACTCTGT    360                                                                  - - GAAGGGCCGA TTCACCATCT CCAGAGACAA CGCCAAGAAC TCCCTGTATC TG -             #CAAATGAA    420                                                                  - - CAGTCTGAGA GCCGAGGACA CGGCCTTGTA TCACTGTGCG AGAGACACAG TG -             #AGGGGAAG    480                                                                  - - CCAGTGAGAG CCCAGACACA AACGTCCCTG CAGG       - #                  -      #       514                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:21:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 519 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO : - #21:                          - - AGGATTCACC ATGGAACTGG GGCTCCGCTG GGTTTTCCTT GTTGCTATTT TA -              #GAAGGTGA     60                                                                  - - ATCATGGAAA AGTAGAGAGA TTTAGTGTGT GTGGATATGA GTGAGAGAAA CG -             #GTGGATGT    120                                                                  - - GTGTGACAGT TTCTGACCAA TGTCTCTCTG TTTGCAGGTG TCCAGTGTGA GG -             #TGCAACTG    180                                                                  - - GTGGAGTCTG GGGGAGGCCT GGTCAAGCCT GGGGGGTCCC TGAGACTCTC CT -             #GTGCAGCC    240                                                                  - - TCTGGATTCA CCTTCAGTAG CTATAGCATG AACTGGGTCC GCCAGGCTCC AG -             #GGAAGGGG    300                                                                  - - CTGGAGTGGG TCTCATCCAT TAGTAGTAGT AGTAGTTACA TATACTACGC AG -             #ACTCAGTG    360                                                                  - - AAGGGCCGAT TCACCATCTC CAGAGACAAC GCCAAGAACT CACTGTATCT GC -             #AAATGAAC    420                                                                  - - AGCCTGAGAG CCGAGGACAC GGCTGTGTAT TACTGTGCGA GAGACACAGT GA -             #GGGGAAGT    480                                                                  - - CAGTGTGAGC CCAGACACAA ACCTCCCTGC AGGGGTCCC      - #                       - #   519                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:22:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 606 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                               - - CTACAGCTCT GGGAGAGGAC CCCCAGCCCT GGGATTTTCA GATGTTTTCA TT -              #TGGTGATC     60                                                                  - - AGGACTGAAC ACAGAGGACT CACCATGGAG TCATGGCTGA GCTGGGTTTT TC -             #TTGCCGCT    120                                                                  - - ATTTTAAAAG GTAATTCATT GAGAACTATT GAAATTGAGT GTGAGCGGAT AA -             #GAGTGAGA    180                                                                  - - GAAACAGTGG ATACGTGTGG CAGTTTCTGA CCAGGGTTTC TTTTTGTTTG CA -             #GGTGTCCA    240                                                                  - - GTGTGAGGTG CATCTGGTGG AGTCTGGGGG AGCCTTGGTA CAGCCTGGGG GG -             #TCCCTGAG    300                                                                  - - ACTCTCCTGT GCAGCCTCTG GATTCACCTT CAGTTACTAC TACATGAGCG GG -             #GTCCGCCA    360                                                                  - - GGCTCCCGGG AAGGGGCTGG AATGGGTAGG TTTCATTAGA AACAAAGCTA AT -             #GGTGGGAC    420                                                                  - - AACAGAATAG ACCACGTCTG TGAAAGGCAG ATTCACAATC TCAAGAGATG AT -             #TCCAAAAG    480                                                                  - - CATCACCTAT CTGCAAATGA AGAGCCTGAA AACCGAGGAC ACGGCCGTGT AT -             #TACTGTTC    540                                                                  - - CAGAGACACA GTGAGGGGAG GTCAGTGTGA GCCCGGACAC AAACCTCCCT GC -             #AGGGGCGC    600                                                                  - - GCGGGG                 - #                  - #                  -      #          606                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:23:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 514 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                               - - GAACTCACCA TGGAGTTTGG GCTGAGCTGG CTTTTTCTTG TGGCTAAAAT AA -              #AAGGTAAT     60                                                                  - - TCATGGAGAA ATAGAAAAAT TGAGTGTGAA TGGATAAGAG TGAGAGAAAC AG -             #TGGATACG    120                                                                  - - TGTGGCAGTT TCTGACCAGG GTTTCTTTTT GTTTGCAGGT GTCCAGTGTG AG -             #GTGCAGCT    180                                                                  - - GTTGGAGTCT GGGGGAGGCT TGGTACAGCC TGGGGGGTCC CTGAGACTCT CC -             #TGTGCAGC    240                                                                  - - CTCTGGATTC ACCTTTAGCA GCTATGCCAT GAGCTGGGTC CGCCAGGCTC CA -             #GGGAAGGG    300                                                                  - - GCTGGAGTGG GTCTCAGCTA TTAGTGGTAG TGGTGGTAGC ACATACTACG CA -             #GACTCCGT    360                                                                  - - GAAGGGCCGG TTCACCATCT CCAGAGACAA TTCCAAGAAC ACGCTGTATC TG -             #CAAATGAA    420                                                                  - - CAGCCTGAGA GCCGAGGACA CGGCCGTATA TTACTGTGCG AAAGACACAG TG -             #AGGGGAAG    480                                                                  - - TCATTGTGAG CCCAGACACA AACCTCCCTG CAGG       - #                  -      #       514                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:24:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 600 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                               - - CCCAGAGACC ATCACACAAC AGCCACATCC CTCCCCTACA GAAGCCCCCA GA -              #GCGCAGCA     60                                                                  - - CCTCACCATG GACTGCACCT GGAGGATCCT CTTCTTGGTG GCAGCAGCTA CA -             #GGCAAGAG    120                                                                  - - AATCCTGAGT TCCAGGTCTG ATGAGGGGAC TGGGTCCAGT TAAGTGGTGT CT -             #CATCCACT    180                                                                  - - CCTCTGTCCT CTCCACAGGC ACCCACGCCC AGGTCCAGCT GGTACAGTCT GG -             #GGCTGAGG    240                                                                  - - TGAAGAAGCC TGGGGCCTCA GTGAAGGTCT CCTGCAAGGT TTCCGGATAC AC -             #CCTCACTG    300                                                                  - - AATTATCCAT GCACTGGGTG CGACAGGCTC CTGGAAAAGG GCTTGAGTGG AT -             #GGGAGGTT    360                                                                  - - TTGATCCTGA AGATGGTGAA ACAATCTACG CACAGAAGTT CCAGGGCAGA GT -             #CACCATGA    420                                                                  - - CCGAGGACAC ATCTACAGAC ACAGCCTACA TGGAGCTGAG CAGCCTGAGA TC -             #TGAGGACA    480                                                                  - - CGGCCGTGTA TTACTGTGCA ACAGACACAG TGTGAAAACC CACATCCTGA GA -             #GCGTCAGA    540                                                                  - - AACCCTGAGG AATGAGGCAG CTGTGCTGAG GCTGAGGAGA TGACAGGATT TA -             #TGAAGTTT    600                                                                  - -  - - (2) INFORMATION FOR SEQ ID NO:25:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 655 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                               - - ATTCACGTTT TCGAGCTCGG TACCCGGGGG ATCCTCTAGA GTCGACCTGC AG -             #CTCTGGGA     60                                                                  - - GAGGAGCCCA GCCCCCGAAT TCCCAGGTGT TTTCATCTGG TGATCAGCAC CG -             #AACACAGA    120                                                                  - - GGACTCACCA TGGAGTTTGT GCTGAGCTGG GTTTTCCTTG TTGCTATTTT AA -             #AACGTGAT    180                                                                  - - CTATAGAGAA CTAGAGATAT TGAGTATGAA TGGATATGAG TGAGAAACAG TG -             #GATACGTG    240                                                                  - - TGGCAGTTTC TGACCGGGGT GTCTCTGTGT TTGCAGGTAT CCAGTGTGAG AT -             #GCAGCTGG    300                                                                  - - TGGAGTCTGG GGGAGGCTTG CAAAAGCCTG CGTGGTCCCC GAGACTCTCC TG -             #TGCAGCCT    360                                                                  - - CTCAATTCAC CTTCAGTAGC TACTACATGA ACTGTGTCCG CCAGGCTCCA GG -             #GAATGGGC    420                                                                  - - TGGAGTTGGT TTGACAAGTT AATCCTAATG GGGGTAGCAC ATACCTCATA GA -             #CTCCGGTA    480                                                                  - - AGGACCGATT CAATACCTCC AGAGATAACG CCAAGAACAC ACTTCATCTG CA -             #AATGAACA    540                                                                  - - GCCTGAAAAC CGAGGACACG GCCCTCTATT AGTGTACCAG AGACACAGTG AG -             #GGGAGGTC    600                                                                  - - AGTGTGAGCC CAGACACAAA CCTCCCTGCA GGCATGCAAG CTTGGCACTG AC - #CGT              655                                                                        - -  - - (2) INFORMATION FOR SEQ ID NO:26:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 546 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                               - - AGTGACTCCT GTGCCCCACC ATGGACACAC TTTGCTACAC ACTCCTGCTG CT -              #GACCACCC     60                                                                  - - CTTCCTGTGA GTGCTGTGGT CAGGGACTTC CTCAGAAGTG AAACATCAGT TG -             #TCTCCTTT    120                                                                  - - GTGGGCTTCA TCTTCTTATG TCTTCTCCAC AGGGGTCTTG TCCCAGGTCA CC -             #TTGAAGGA    180                                                                  - - GTCTGGTCCT GTGCTGGTGA AACCCACAGA GACCCTCACG CTGACCTGCA CC -             #GTCTCTGG    240                                                                  - - GTTCTCACTC AGCAATGCTA GAATGGGTGT GAGCTGGATC CGTCAGCCCC CA -             #GGGAAGGC    300                                                                  - - CCTGGAGTGG CTTGCACACA TTTTTTCGAA TGACGAAAAA TCCTACAGCA CA -             #TCTCTGAA    360                                                                  - - GAGCAGGCTC ACCATCTCCA AGGACACCTC CAAAAGCCAG GTGGTCCTTA CC -             #ATGACCAA    420                                                                  - - CATGGACCCT GTGGACACAG CCACATATTA CTGTGCACGG ATACCACAGA GA -             #CACAGCCC    480                                                                  - - AGGATGCCTC CTGTACAAGA ACCTAGCTGC ATCTCAGTGG TGCTCCCTCC CT -             #ACCTCTGC    540                                                                  - - AGAACA                 - #                  - #                  -      #          546                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:27:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 587 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                               - - TGAGAGCATC ATCCAACAAC CACAACTCTC CTCAGAAGAA GCCCCTAGAC CA -              #CAGCACCT     60                                                                  - - CAACATGTAC TGGACCTGGA GGATCCTCTT CTTGGTGGCA GCAGCAACAG GT -             #AAGGGACC    120                                                                  - - TCCCAGTCAC CGGGCTGAGA GAGAAACCAG GCCAGTCAAG TGAGACTTCA CG -             #CACTCCTG    180                                                                  - - TCTCCTCTCC ACAGGTGTCC ACTCACAGGT GCAGCTGGTG CAGTCTGGGC CT -             #GAGGTGAA    240                                                                  - - GAAGCCTGGA GCCTCATTGA AGGTTTCCTG CAAGGCTTCT GGATACACCT TC -             #ACAAGCTA    300                                                                  - - TGCTATCAGC TGGGTATGAC AGGCCCATGG ACAAGGGCTT GAGGAAATGG GA -             #TGGATCAA    360                                                                  - - CACCAACACT GGGAACCTAA CGTATGCCCA GGGCTTCACA GGACGGTTTG TC -             #TTCTCCAT    420                                                                  - - GGACACCTCC GTCAGCATGG CATATCTTCA TATCAGCAGC CTAAAGGCTG AG -             #GACACGTG    480                                                                  - - CAAGAGGCAC AGTGTGGAAA CCCACATCCT GAGAGAACCA GAAATCCTGA GG -             #GAGGAGGC    540                                                                  - - AGCTGTGCTG AGCTGAGGCA GTGACAGGGA CAACGTGGCT GCACCCT   - #                    587                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:28:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 624 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                               - - CATCCCTTTT CACCTCTCCA TACAGAGGCA CCACCCACAT GCAAATCTCA CT -              #TAGGCACC     60                                                                  - - CAAGGGAAAC CATCACACAT TTCCTTAAAT TCAGGGTCCT GCTCACATGG GA -             #AATACTTT    120                                                                  - - CTGAGAGCTC TGGACCTCCT GTGCAAGAAC ATGAAACACC TGTGGTTCTT CC -             #TCCTGCTG    180                                                                  - - GTGGCAGCTC CCAGATGTGA GTGTCTCAAG GCTGCAGACA TGGAGATATG GG -             #AGGTGCCT    240                                                                  - - CTGAGCCCAG GGCTCACTGT GGGTCTCTCT GTTCACAGGG GTCCTGTCCC AG -             #GTGCAGCT    300                                                                  - - GCAGGAGTCG GGCCCAGGAC TGGTGAAGCC TTCGGACACC CTGTCCCTCA CC -             #TGCGCTGT    360                                                                  - - CTCTGGTTAC TCCATCAGCA GTAGTAACTG GTGGGGCTGG ATCCGGCAGC CC -             #CCAGGGAA    420                                                                  - - GGGACTGGAG TGGATTGGGT ACATCTATTA TAGTGGGAGC ACCTACTACA AC -             #CCGTCCCT    480                                                                  - - CAAGAGTCGA GTCACCATGT CAGTAGACAC GTCCAAGAAC CAGTTCTCCC TG -             #AAGCTGAG    540                                                                  - - CTCTGTGACC GCCGTGGACA CGGCCGTGTA TTACTGTGCG AGAAACACAG TG -             #AGGGGAGG    600                                                                  - - TGAGTGTGAG CCCAGACACA AACC          - #                  - #                    624                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:29:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 304 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                               - - GTCAGATACA CCATGCAGAC TCTGTGAAGG GCAGATTCTC CATCTCCAAA GA -              #CAATGCTA     60                                                                  - - AGAACTCTCT GTATCTGCAA ATGAACAGTC AGAGAACTGA GGACATGGCT GT -             #GTATGGCT    120                                                                  - - GTACATAAGG TTCCAAGTGA GGAAACATCG GTGTGAGTCC AGACACAAAA TT -             #TCCTGCAA    180                                                                  - - AAAGAAGAAA GGAGTCTGGG CCAAAGGGGA CACTCAGCAC TCACAAAACA GG -             #TGCAGCCC    240                                                                  - - CACGGCAGGT GCAGATGGAG GGAGGGTAAG GGCTGNTTTC CTTCAGGATC TG -             #TGGGTTTC    300                                                                  - - CTCT                 - #                  - #                  - #                 304                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:30:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 512 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                               - - GGACTCACCA TGGAGTTTGG GCTGAGCTGG GTTTTCCTCG TTGCTCTTTT AA -              #GAGGTGAT     60                                                                  - - TCATGGAGAA ATAGAGAGAC TGAGTGTGAG TGAACATGAG TGAGAAAAAC TG -             #GATTTGTG    120                                                                  - - TGGCATTTTC TGATAACGGT GTCCTTCTGT TTGCAGGTGT CCAGTGTCAG GT -             #GCAGCTGG    180                                                                  - - TGGAGTCTGG GGGAGGCGTG GTCCAGCCTG GGAGGTCCCT GAGACTCTCC TG -             #TGCAGCCT    240                                                                  - - CTGGATTCAC CTTCAGTAGC TATGGCATGC ACTGGGTCCG CCAGGCTCCA GG -             #CAAGGGGC    300                                                                  - - TGGAGTGGGT GGCAGTTATA TCATATGATG GAAGTAATAA ATACTATGCA GA -             #CTCCGTGA    360                                                                  - - AGGGCCGATT CACCATCTCC AGAGACAATT CCAAGAACAC GCTGTATCTG CA -             #AATGAACA    420                                                                  - - GCCTGAGAGC TGAGGACACG GCTGTGTATT ACTGTGCGAG AGACACAGTG AG -             #GGGAAGTC    480                                                                  - - ATTGTGCGCC CAGACACAAA CCTCCCTGCA GG       - #                  - #              512                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:31:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 631 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                               - - CATCCCTTTT CACCTGTCCA TAGAGAGGCA CCAGCCACAT GCAAATCTCA CT -              #TAGGCACC     60                                                                  - - CACAGAAAAC CGCCACACAT TTCCTTAAAA TCAGGGTCCT GCTCACATGG GA -             #AATACTTT    120                                                                  - - CTGAGAGTCC TGGACCTCCT GTGCGAGAAC ATGAAACACC TGTGGTTCTT CC -             #TCCTGCTG    180                                                                  - - GTGGCAGCTC CCAGATGTGA GTGTCTCAAG GCTGCAGACA TGGAGATATG GG -             #AGGTGCCT    240                                                                  - - CTGATCCCAG GGCTCACTGT GTGTCTCTCT GTTCACAGGG GTCCTGCCCC AG -             #GTGCAGCT    300                                                                  - - GCAGGAGTCG GGCCCAGGAC TGGTGAAGCC TTCACAGACC CTGTCCCTCA CC -             #TGTACTGT    360                                                                  - - CTCTGGTGGC TCCATCAGCA GTGGTGGTTA CTACTGGAGC TGGATCCGCC AG -             #CACCCAGG    420                                                                  - - GAAGGGCCTG GAGTGGATTG GGTACATCTA TTACAGTGGG AGCACCTACT AC -             #AACCCGTC    480                                                                  - - CCTCAAGAGT CGAGTTACCA TATCAGTAGA CACGTCTAAG AACCAGTTCT CC -             #CTGAAGCT    540                                                                  - - GAGCTCTGTG ACTGCCGCGG ACACGGCCGT GTATTACTGT GCGAGAGACA CA -             #GTGAGGGG    600                                                                  - - AGGTGAGTGT GAGCCCAGAC ACAAACCTCC C        - #                  - #              631                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:32:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 341 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                               - - ACCAGTCTCC AGGCAAGGGG CTGGAGTGAG TAATAGATAT AAAAGATGAT GG -              #AAGTCAGA     60                                                                  - - TACACCATGC AGACTCTGTG AAGGGCAGAT TCTCCATCTC CAAAGACAAT GC -             #TAAGAACT    120                                                                  - - CTCTGTATCT GCAAATGAAC ACTCAGAGAG CTGAGGACGT GGCCGTGTAT GG -             #CTATACAT    180                                                                  - - AAGGTCCCAA GTGAGGAAAT ATCGGTGTGA GTCCAGACAC AACATTTCCT GC -             #AAAAAGAA    240                                                                  - - GAAAGGAGTC TGGGCCGAAG GGGACACTCA GCACTCACAA AACAGGTGCA GC -             #CCCACGGC    300                                                                  - - AGGTGCAGAT GGAGGGAGGG TAAGGGCTGC TTTTCCTTCA G    - #                       - #  341                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:33:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 583 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:                               - - TGAACACAGA GGACTCACCA TGGAGTTTGG GCTGAGCTGG GTTTTCCTCG TT -              #GCTCTTTT     60                                                                  - - AAGAGGTGAT TCATTGGAGA AATAGAGAGA CTGAGTGTGA GTGAACATGA GT -             #GAGAAAAA    120                                                                  - - CTGGATTTGT GTGGCATTTT CTGATAACGG TGTCCTTCTG TTTGCAGGTG TC -             #CAGTGTCA    180                                                                  - - GGTACAGCTG GTGGAGTCTG GGGGAGGCGT GGTCCAGCCT GGGAGGTCCC TG -             #AGACTCTC    240                                                                  - - CTGTGCAGCG TCTGGATTCA CCTTCAGTAG CTATGGCATG CACTGGGTCC GC -             #CAGGCTCC    300                                                                  - - AGGCAAGGGG CTGGAGTGGG TGGCAGTTAT ATGGTATGAT GGAAGTAATA AA -             #TACTATGC    360                                                                  - - AGACTCCGCG AAGGGCCGAT TCACCATCTC CAGAGACAAT TCCACGAACA CG -             #CTGTTTCT    420                                                                  - - GCAAATGAAC AGCCTGAGAG CCGAGGACAC GGCTGTGTAT TACTGTGCGA GA -             #GACACAGT    480                                                                  - - GAGGGGAGGT CATTGTGCGC CCAGACACAA ACCTCCCTGC AGGAACGCTG GC -             #GGGAAATC    540                                                                  - - AGCTGCAGGG GGGGCTCAGG AGCCACTGAT CAGAGTCAGC CCT    - #                       - #583                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:34:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 687 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:                               - - AAAAGACTGG GCCCTCCCTC ATCCCTTTTT ACCTATCCAT ACAAAGGCAC CA -              #CCCACATG     60                                                                  - - CAAATCCTCA CTTAGGCACC CACAGGAAAT GACTACACAT TTCCTTAAAT TC -             #AGGGTCCA    120                                                                  - - GCTCACATGG GAAGTGCTTT CTGAGAGTCA TGGACCTCCT GCACAAGAAC AT -             #GAAACACC    180                                                                  - - TGTGGTTCTT CCTCCTCCTG GTGGCAGCTC CCAGATGTGA GTGTCTCAGG AA -             #TGCGGATA    240                                                                  - - TGAAGATATG AGATGCTGCC TCTGATCCCA GGGCTCACTG TGGGTTTCTC TG -             #TTCACAGG    300                                                                  - - GGTCCTGTCC CAGGTGCAGC TACAACAGTG GGGCGCAGGA CTGTTGAAGC CT -             #TCGGAGAC    360                                                                  - - CCTGTCCCTC ACCTGCGCTG TCTATGGTGG GTCCTTCAGT GGTTACTACT GG -             #AGCTGGAT    420                                                                  - - CCGCCAGCCC CCAGGGAAGG GGCTGGAGTG GATTGGGGAA ATCAATCATA GT -             #GGAAGCAC    480                                                                  - - CAACTACAAC CCGTCCCTCA AGAGTCGAGT CACCATATCA GTAGACACGT CC -             #AAGAACCA    540                                                                  - - GTTCTCCCTG AAGCTGAGCT CTGTGACCGC CGCGGACACG GCTGTGTATT AC -             #TGTGCGAG    600                                                                  - - AGGCACAGTG AGGGGAGGTG AGTGTGAGCC CAGACAAAAA CCTCCCTGCA GG -             #TAGGCAGA    660                                                                  - - GGGGGCGGGC GCAGGTACTG CTCAAGA          - #                  - #                 687                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:35:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 700 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:                               - - AAATAGGAGA CATNCAAATA GGCCCCCCCC TTTCCTGATA AAAAGCAGCC CA -              #GTCCTGAC     60                                                                  - - CCTGCAGCCC TGGGAGAGAA GCACCAGCCC TGGGATTCTC AGGTGTTTCC AC -             #TTTGTCAT    120                                                                  - - CAGCAACAAA CAAATTACCA TGGAATTTGG CCTGAGCTGG GTTTTCCTTG CT -             #GCTATTTT    180                                                                  - - AAAAGGTGAT TCATGAAGAA CTAAGGATAT TGAGTGAGTG GACATGAGTG AG -             #AGAAACAG    240                                                                  - - TGGATTTGTG TGGCAGTTTC TGACCAGGGT GTCTCTGTGT TTGCAGGTGT CC -             #AGTGTGAG    300                                                                  - - GTGCAGCTGG TGGAGTCTGG GGGAGGCTTG GTACAGCCTG GGGGATCCCT GA -             #GACTCTCC    360                                                                  - - TGTGCAGCCT CTGGATTCAC CTTCAGTAAC AGTGACATGA ACTGGGTCCA TC -             #AGGCTCCA    420                                                                  - - GGAAAGGGGC TGGAGTGGGT ATCGGGTGTT AGTTGGAATG GCAGTAGGAC GC -             #ACTATGCA    480                                                                  - - GACTCTGTGA AGGGCCGATT CATCATCTCC AGAGACAATT CCAGGAACAC CC -             #TGTATCTG    540                                                                  - - CAAACGAATA GCCTGAGGGC CGAGGACACG GCTGTGTATT ACTGTGTGAG AA -             #ACACTGTG    600                                                                  - - AGAGGTCGGA AGTGTGAGCC CAGACACAAA CCTCCTGCAG GAACGTTGGG GG -             #AAATCAGC    660                                                                  - - TGCAGGGGGC GCTCAGGACC CACTCATCAG AGTCAACCCC     - #                       - #   700                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:36:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 806 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:                               - - TGACACTAAC TCCCCCAGGA TCTCACATCT GCTCTGGANA CGGCTCTCCT GT -              #TGTCCCTA     60                                                                  - - CCCCAGAGCT TGCTATAGAG GAGGAGACAT CCACATAGGG CCCTCNCTTG TC -             #CTGATGAA    120                                                                  - - AACCAGCCTT GCCTGCGTCT ACGGGAGAAG AGCCCCAGTC CAGAAGTACC AG -             #GGGTTTCC    180                                                                  - - ATTTGGTGGT CAGGTCTCTG AACACAGAGG ACTCACTATG GAGTTTGGGC TG -             #AGCTGGGG    240                                                                  - - TTTCCATGTT GCTAATGTAA AAGGTGACTC ATGGAGAACT AGAGATATTG AG -             #TGTGAGTG    300                                                                  - - GACACAAGTG AGAGAAACAG TGGATATGTG TGGCAGGTTC TGACCAGGGT GT -             #CTGTGTGT    360                                                                  - - GTTTGCAGGT GTCCAGTGTG AGGTGCACCT GGTGGAGTCT TTGGGAGGCT TG -             #TTATAGCC    420                                                                  - - TGGGGGTCCC TGAGACTTTC TTTTGCAGCC TCTGGATTCA CCTTTAGTAC CT -             #TTATTAGG    480                                                                  - - TACTGGATGA GCTGGGTCCA TCAGGCTCCT GGGAAAGGGC TGGAGTAGGT CT -             #CATTTATG    540                                                                  - - AGTTGTTGTG TAGGTAGCAC AAGCTATGCA GACTCTGTGA AGGGTCGATT CA -             #CCCTCTCC    600                                                                  - - AGAGATGATG CCAAGAAATC ACTGTATCTG CAAATGAACA GCGTCAGAGC CG -             #AGGATAGG    660                                                                  - - TCTGTGTATT ACTGTGGTGG CATTGTGTGC ATCCCTTGTT TAGGTACATG CA -             #GAGATGCT    720                                                                  - - GCTTTGGTGT GTTCAGGGGC TCCTGTTTTG GGGACACCAA TTTTGGAGTT TG -             #CAGTATCC    780                                                                  - - TTGAGTCCAG TACGTTCATG GTGGCA          - #                  - #                  806                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:37:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 500 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:                               - - GGAATCACCA TGTTGTTTGG ACTGAGCTGG CCGTTCCGAT TTACTATTTT AA -              #GGGGTGAC     60                                                                  - - ACGTGAAGCA CTACAGATAT TGCTCGTGAG TGGATATTAG AGAAACAGTG GA -             #TATGTGTG    120                                                                  - - GCAGTTTCTG ACCAGGATGT CTCTGTGTTT ACAGGTGTGC AGTATGAGGT GC -             #AGCTGGTA    180                                                                  - - GAGTCTGGGG GAGACTTGGT ACAGCTGTGG TGGGTCCTGA GACTCTCATG TG -             #CAGCCTGT    240                                                                  - - GGATTCATCT TGAGAAGCAA CTGGTCCCAC CGGGCTTCAC GAAAGGGGCT GG -             #CATGGAAT    300                                                                  - - GACATGGTCT CATACATTAG TGCTAGTGGT GGTAGTCTAT ACTATGCAGA CA -             #CTGAAGGG    360                                                                  - - TAGATTCACC ATCTCTAGAG ACAATGGCAA GAACATGCTG TTCTTGCAAA TG -             #AACAGTCT    420                                                                  - - GAGAGATGAG GACTCGGTTG TGTTGAGAGA CATGGTGAGG GGAAAATCAG TA -             #TGAGCCCA    480                                                                  - - GCCAGAACTC TCCCTGCAGG            - #                  - #                       - #500                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:38:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 507 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:                               - - CAACTCATCA TGCAGTTTGT GCTGAGCTGG GTTTTCCTTG TTGGTATTTT AA -              #AAGGTGAT     60                                                                  - - TCATGGAGAA CTACAGATGT TGAGTGTGAG TGGACATGAG TGAGCCAAAC AG -             #TGGGTTTG    120                                                                  - - TGTGGCAGTT TCTGACCTGG TGTCTCTGTG TTTACAGGTG TCCAGTGTGA GG -             #TGCAGCTG    180                                                                  - - GTGGAGTCTG GGGGAGGCTT GGTACAGCCT AGGGGGTCCC TGAGACTCTC CT -             #GTGCAGCC    240                                                                  - - TCTGGATTCA CCGTCAGTAG CAATGAGATG AGCTGGATCC GCCAGGCTCC AG -             #GGAAGGGG    300                                                                  - - CTGGAGTGGG TCTCATCCAT TAGTGGTGGT AGCACATACT ACGCAGACTC CA -             #GGAAGGGC    360                                                                  - - AGATTCACCA TCTCCAGAGA CAATTCCAAG AACACGCTGT ATCTTCAAAT GA -             #ACAACCTG    420                                                                  - - AGAGCTGAGG GCACGGCCGC GTATTACTGT GCCAGATATA CACAGAGGGG AA -             #GTCATTGT    480                                                                  - - GCGCCCAGAC ACAAACCTCC CTGTAGG          - #                  - #                 507                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:39:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 800 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:                               - - AGAAGAGGAC TCTGGGCTTG GAGAGGGGAG CCCCCCAAGA AGAGAAACTT GA -              #TTCTCCAA     60                                                                  - - AGGGCACAGC CAGCATTCTC CTCCCAGGGT GAGCTCCAAA AGACTGGCGC CT -             #CTCTCATC    120                                                                  - - CCTTTTCACT GCTCCGTACA AACGCACNCA CCCCCATGCA AATCCTCACT TA -             #GGCGCCCA    180                                                                  - - CAGGAAGCCA CCACACATTT CCTTAAATTC AGGTCCAACT CATAAGGGAA AT -             #GCTTTCTG    240                                                                  - - AGAGTCATGG ATCTCATGTG CAAGAAAATG AAGCACCTGT GGTTCTTCCT CC -             #TGCTGGTG    300                                                                  - - GCGGCTCCCA GATGTGAGTG TTTCTAGGAT GCAGACATGG AGATATGGGA GG -             #CTGCCTCT    360                                                                  - - GATCCCAGGG CTCACTGTGG GTTTTTCTGT TCACAGGGGT CCTGTCCCAG CT -             #GCAGCTGC    420                                                                  - - AGGAGTCGGG CCCAGGACTG GTGAAGCCTT CGGAGACCCT GTCCCTCACC TG -             #CACTGTCT    480                                                                  - - CTGGTGGCTC CATCAGCAGT AGTAGTTACT ACTGGGGCTG GATCCGCCAG CC -             #CCCAGGGA    540                                                                  - - AGGGGCTGGA GTGGATTGGG AGTATCTATT ATAGTGGGAG CACCTACTAC AA -             #CCCGTCCC    600                                                                  - - TCAAGAGTCG AGTCACCATA TCCGTAGACA CGTCCAAGAA CCAGTTCTCC CT -             #GAAGCTGA    660                                                                  - - GCTCTGTGAC CGCCGCAGAC ACGGCTGTGT ATTACTGTGC GAGACACACA GT -             #GAGGGGAG    720                                                                  - - GTGAGTGTGA GCCCAGACAA AAACCTCCCT GCAGGGAGGC TGAGGGGGCG GT -             #CGCAGGTG    780                                                                  - - CAGCTCAGNG CCAGCAGGGG            - #                  - #                       - #800                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:40:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 970 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:                               - - CACAACCTCC ATGAAAAACA ACATAGAAAT TTCTCAAAGA ACTAAAATTA GA -              #ATTACCAT     60                                                                  - - TTCTTCCAGT AAGCTGTCCC AGTAGGCATG TTCCTCCCAA ACTTTTATNT CA -             #GAGAATGT    120                                                                  - - TGCCTGCACT CATATGTTTA TTTCAACACC ATTTTCAATA GAAAAGTCAA AT -             #AATCTAAG    180                                                                  - - TGTCAATCAG TGGATGATTA GATAAAATAT GATATNNATG TAAATCATNG GA -             #ATACTATG    240                                                                  - - CAGCCAGTAT GGTATGAATT CAGTNGTGAN NCCNAGCCCC TGGACAAGNN GG -             #CTTGAGTG    300                                                                  - - GATGGGATGG ATCATCACCT ACACTGGGAA CCCAACATAT ACCAACGGCT TC -             #ACAGGACG    360                                                                  - - GGTTTCTATT CTCCATGGGA CACCTCTGTC AGCATGGCGT ATCTGAAGAT CA -             #GCAGCCTA    420                                                                  - - AAGGCTGAGG ACACGGCCGC GTATGACTGT ATGAGAGACA CAGGGTGGAA AC -             #CCACATCC    480                                                                  - - CGAGGGAGTC AGAAACCCCG GGGGAGGAGC CACCTGTTCT GACCTGAGNC AG -             #TGGTCCAA    540                                                                  - - NCAGTNTCTT TAACNTCCAT ATGATCTCAT TTTTGCATCA TCTTCTACTT TT -             #ATATTAGC    600                                                                  - - TAAGAACTTG GGGTAGACAG GTGCTCCTAA GAGATCCTTA ACTTGCCCAT TT -             #TGATGGGT    660                                                                  - - TTTCCAGAAG ACGTGAGAAG CCACTTTGTT ANCAAAGCAT CCCAAATCCA TG -             #CCCTGTTN    720                                                                  - - CTAGATACAT GTGAGCCCAT TTCCTGGTCT TTGCTTAACT GACAAGCTCT CA -             #TCAGTGCA    780                                                                  - - CCTGGGCTAA TTTCACATCA GGTAGAGGAA CGCGTTATAA AGGAAAGCTA AT -             #GTTGTAAT    840                                                                  - - AGCAATTCCT GCTTAAAAAC CTTCAGCTTC ATTGTTTTTG TGTAATCCAT CA -             #NCAAATTA    900                                                                  - - TGTTAGTTCA AGGTTCTCAA TGGGAGTTTC TAATAAATAG AAAGGATGTA TA -             #AAGCTTGN    960                                                                  - - CACTGNCCGT                - #                  - #                       - #       970                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:41:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 819 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41:                               - - CCCCACTCTC TCCTCAGNCG TCCCATCCCA GAGCTTGGCA TTGTAGTAGG AG -              #ACATCCAA     60                                                                  - - ATAGAGCCCT CCCTCTGCTT ATGAAAACCA GCCCAGCCCT GACCCTGCAG CT -             #GTGGGAGA    120                                                                  - - GGAGCCCCAG CCCTGGGATT TTCAGGTGCT TTCATTTTGT GATCAGGACT GA -             #ACACAGAG    180                                                                  - - GATTCACCAT GGAGTCATGG CTGAGCTGGG TTTTTCTTGC CGCTATTTTA AA -             #AGGTAATT    240                                                                  - - CATTGAGAAC TATTGAAATT GAGTGTGAGT GGATAAGAGT GAGATAAACA GT -             #GGATACGT    300                                                                  - - GTGGCAGTTT CTGACCAGGG TTTCTTTGTG TTTGCAGGTG TCCAGTGTGA GG -             #TGCAGCTG    360                                                                  - - GTGGAGTCTG GGGGAGGCTT GGTCCAGCCT GGGGGGTCCC TGAGACTCTC CT -             #GTGCAGCC    420                                                                  - - TCAGGATTCT CCTTTAGTAG CTATGGCATG AGCTGGGTCC GCCAGGCTCC AG -             #GGAAGGGG    480                                                                  - - CTGGAGTGAG TGGCACATAT CTGGAATGAT GGAAGTCAGA AATACTATGC AG -             #ACTCTGTG    540                                                                  - - AAGGGCCGAT TCACAATCTC CGAGACAATT CTAAGAGCAT GCTCTATCTG CA -             #AATGGACA    600                                                                  - - GTCTGAAAGC TAAGGACACG GCCATGTATT ACTGTACCAG ACACAGTGAG AG -             #GAAGTCCG    660                                                                  - - TGTGAGCCCA GACACAAACC TCCCTGCAGG GGCACGCGGG GCCACCAGAG GG -             #TGCCCAGG    720                                                                  - - ATCCCCTGAA GACAGGGACA GNCCAAAGGC AGGTGCAGAT GGNTGTCAAG AG -             #GGTCTTGT    780                                                                  - - GGCTTCGTCT ACATCTAACT GGTTTCCTGG GTGAGCCTC      - #                       - #   819                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:42:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 471 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:                               - - TTGTAGGTGA TTTATGGAGA ATGAGAGATG TTGAGTGCGA GTGGACATGA GT -              #GAGAGAAA     60                                                                  - - CAGTAGATAT GTGTGCCCGT TTCTGACCAG GGTGTCTCTG TGTTTGCAGG CG -             #TCCAGCGT    120                                                                  - - GAGGCGCAGC TGGTGGAGTC TGGGGGAGAC TTGGTACAAC CTGGGTGGGT CC -             #CCGAGACT    180                                                                  - - CTCATTTGCA GCTTCTAGAT TCACCTTCAG TGACTTCTGA ATGCACTGGA TC -             #CGCCAGGC    240                                                                  - - TTCTGGGAAA GGGCTGGAGT GGGTTGGCCG TATTAGAACC AAACGTAACA GT -             #TACACGAC    300                                                                  - - AGAATGCGCT GCATCTGTGA AAGGCAGGTT CACCATCTCA AGAGATGATT CA -             #AAGAACAC    360                                                                  - - ACTGTATCTG CAAGTGAATA CCCTGAAAAC CGAGTACACG GCCATCTATT AC -             #TGTACTAG    420                                                                  - - AGACAGTGAG GGGGAGGTTA ACGTAGGCCC ATACACAAAT CTCCCTGCAG G - #                 471                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:43:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 870 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:                               - - CATCTGTTAC AGAACTCATT ATATAGTAGG AGACATCCAA ATNGGGTCCC TC -              #CCTCTGCT     60                                                                  - - GATGAAAACC AGCCCAGCCC TGACCCTGCA GCTCTGGGAG AGGAGCCCCA GC -             #CCTGAGAT    120                                                                  - - TCCCAGGTGT TTCCATTCGG TGATCAGCAC TGAACACAGA GAACGCACCA TG -             #GAGTTTGG    180                                                                  - - ACTGAGCTGG GTTTTCCTTG TTGCTATTTT AAAAGGTGAT TCATGGATAA AT -             #AGAGATGT    240                                                                  - - TGAGTGTGAG TGAACATGAG TGAGAGAAAC AGTGGATATG TGTGGCAGTG TC -             #TGACCAGG    300                                                                  - - GTGTCTCTGT GTTTGCAGGT GTCCAGTGTG AAGTGCAGCT GGTGGAGTCT GG -             #GGGAGTCG    360                                                                  - - TGGTACAGCC TGGGGGGTCC CTGAGACTCT CCTGTGCAGC CTCTGGATTC AC -             #CTTTGATG    420                                                                  - - ATTATACCAT GCACTGGGTC CGTCAAGCTC CGGGGAAGGG TCTGGAGTGG GT -             #CTCTCTTA    480                                                                  - - TTAGTTGGGA TGGTGGTAGC ACATACTATG CAGACTCTGT GAAGGGCCGA TT -             #CACCATCT    540                                                                  - - CCAGAGACAA CAGCAAAAAC TCCCTGTATC TGCAAATGAA CAGTCTGAGA AC -             #TGAGGACA    600                                                                  - - CCGCCTTGTA TTACTGTGCA AAAGATACAC AGTGAGGGGA AGTCAGCGAG AG -             #CCCAGACA    660                                                                  - - AAAACCTCGC TGCAGGAAGA CAGGAGGGGC CTGGGCTGCA GAGGCCACTC AA -             #GACACACT    720                                                                  - - GAGCATAGGG TTAACTCTGG GACAAGTTGC TCAGGAAGGT TAAGAGCTGG TT -             #TCCTTTCA    780                                                                  - - GAGTCTTCAC AAATTTCTCC ATCTAACAGT TTCCCCAGGA ACCNGTCTAG AT -             #CTGTGATC    840                                                                  - - TTGGATCTGC TGAAACTGCC TGTGTCACCT         - #                  - #               870                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:44:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 529 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:                               - - TCCCCGGGTA CCGAGCTCAA GTGCCAGGAT TCCCAGGTGT TTTCACTTGG TG -              #ATCAGAAC     60                                                                  - - TTAACACAGA GGACTCACCA TGTTGTTTGG GCTGAGCTGG GCTTTCCTTG TT -             #ACTATTTT    120                                                                  - - AAGAGGTGAT TCATGAAGAA CTACAGATAT TGTTTGTGAG TGGATATTAG AG -             #AAACAGTG    180                                                                  - - GATATGTGTG GCAGTTGCTG ACCAGGATTT CTCTGTGTTT GCAGGTGTGC AG -             #TATGAGGT    240                                                                  - - GCAGCTGGTA GAGTCTTTTT TTTTTTTTTT TTTTCACTTT TTAGCGAACA TC -             #CATGGGTT    300                                                                  - - ACAAAATAAT GGGTTGGCTT TTCTTCCAAC ACTTTACAGA CACCATCAAT TT -             #TCCCCTTG    360                                                                  - - CTTATAAGGT TTTTAACCAG AAGAATGCTG TCATCATCTT TCCTGTTCTT TT -             #AGGAAGAA    420                                                                  - - TGCCCCCTCA ACTCATCTCC ACTTGTCTGC ATGTATTTCT ATTTGTCTTG GA -             #CGTTCCCA    480                                                                  - - ACAGCCTCNC GAACACTCAC CTCACCCTAC AATGCTGCTC GAGGGGGTC  - #                   529                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:45:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 748 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45:                               - - CAGGATCAGG GCTTGAGTCA TCAGCATCTC ACTCTTGCAA AGNCTGATGT GT -              #CGTTTGTC     60                                                                  - - TTCCCTTTCT TATCATCGAC CAGGCTTTGA GCTATGAAAT GCCCTGTCTC AT -             #CAATATNC    120                                                                  - - AAATAACCTG AGATCGACTG AGGTAAATAT GGATATGTCT GTGCCCTGAG AG -             #CATCACCC    180                                                                  - - AACAAACCAC ATCCCTCCTC TAGAGAATCC CCTGAAAGCA CAGCTCCTCA CC -             #ATGGACTG    240                                                                  - - GACCTGGAGA ATCCTCTTCT TGGTGGCAGC AGCCACAGGT AAGGGGCTCC CA -             #AGTCCCAG    300                                                                  - - TGATGAGGAG GGGATTGAGT CCAGTCAAGG TGGCTTTTAT CCACTCCTGT GT -             #CCCCTCCA    360                                                                  - - CAGATGCCTA CTCCCAGATG CAGCTGGTGC AGTCTGGGGC TGAGGTGAAG AA -             #GACTGGGT    420                                                                  - - CCTCAGTGAA GGTTTCCTGC AAGGCTTCCG GATACACCTT CACCTACCGC TA -             #CCTGCACT    480                                                                  - - GGGTGCGACA GGCCCCCGGA CAAGCGCTTG AGTGGATGGG ATGGATCACA CC -             #TTTCAATG    540                                                                  - - GTAACACCAA CTACGCACAG AAATTCCAGG ACAGAGTCAC CATTACCAGG GA -             #CAGGTCTA    600                                                                  - - TGAGCACAGC CTACATGGAG CTGAGCAGCC TGAGATCTGA GGACACAGCC AT -             #GTATTACT    660                                                                  - - GTGCAAGATA CACAGTGTGA AAACCCACAT CCTGAGACCG TCAGAAACCC CA -             #AGGAGGAG    720                                                                  - - GCAGCTTCAC TGAATGAGGA GGTTACAG         - #                  - #                 748                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:46:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 799 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46:                               - - CATTTCTTCA AAGCAGGATT AGGGCTTGGA CCATCAGCAT CCCACTCCTG TG -              #TGGCAGAT     60                                                                  - - GGGACATCTA TCTTCTTTCT CCAACCTCGA TCAGGCTTTT GAGGTATGAA AT -             #AATCTGTC    120                                                                  - - TCATGAATAT GCAAATAACC TTAGATCTAC TGAGGTAAAT ATGGATACAT CT -             #GGGCCCTG    180                                                                  - - AAAGCATCAT CCAACAACCA CATCCCTTCT CTACAGAAGC CTCTGAGAGG AA -             #AGTTCTTC    240                                                                  - - ACCATGGACT GGACCTGGAG GGTCTTCTGC TTGCTGGCTG TAGCTCCAGG TA -             #AAGGGCCA    300                                                                  - - ACTGGTTCCA GGGCTGAGGA AGGGATTTTT TCCAGTTTAG AGGACTGTCA TT -             #CTCTACTG    360                                                                  - - TGTCCTCTCC GCAGGTGCTC ACTCCCAGGT GCAGCTGGTG CAGTCTGGGG CT -             #GAGGTGAA    420                                                                  - - GAAGCCTGGG GCCTCAGTGA AGGTTTCCTG CAAGGCATCT GGATACACCT TC -             #ACCAGCTA    480                                                                  - - CTATATGCAC TGGGTGCGAC AGGCCCCTGG ACAAGGGCTT GAGTGGATGG GA -             #ATAATCAA    540                                                                  - - CCCTAGTGGT GGTAGCACAA GCTACGCACA GAAGTTCCAG GGCAGAGTCA CC -             #ATGACCAG    600                                                                  - - GGACACGTCC ACGAGCACAG TCTACATGGA GCTGAGCAGC CTGAGATCTG AG -             #GACACGGC    660                                                                  - - CGTGTATTAC TGTGCGAGAG ACACAGTGTG AGAAACCACA TCCTCAGAGT GT -             #CAGAAACC    720                                                                  - - CTGAGGGAGG AGTCAGCTGT GCTGAGCTGA GAAAATGACA GGGGTTATTC AG -             #TTTAAGAC    780                                                                  - - TGTTTAGAAA ACGGGTTAT             - #                  - #                       - #799                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:47:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 627 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:                               - - CCAAATAAAC ACATTAAATG TCAAGATACG CCCAAAAACT TATCTGCCTG AC -              #CCCCTAGT     60                                                                  - - TGTCTCCGTA ATTTTTGGAT GAAAACCAGC CCACCCCTGA CCCTGCTGCT CT -             #GGGAGAGG    120                                                                  - - AGCCCCAGCC TTGGGATTCC CAAGTGTTTG CATTCAGTGA TCAGGACTGA AC -             #ACACAGGA    180                                                                  - - CTCACCAGGG AGTTTGTGCT AAGCTGGGTT TTCCTTGTTG CTATATTAAA AT -             #GTGATTCA    240                                                                  - - TGGAGAACTA GAGAGATTGA GTGTGAGTTA CATGAGTGAG AGAAACAGTG GA -             #TATGTTTG    300                                                                  - - GCAATTTCTG ACTTTTGTGT CTCTGTGTTT GCAGGTGTCC AGTGTGAGGA TC -             #AGCTGGTG    360                                                                  - - GAGTCTGGGG GAGGCTTGGT ACAGCCTGGG GGGTCCCTGA GACCCTCCTG TG -             #CAGCCTCT    420                                                                  - - GGATTCGCCT TCAGTAGCTA TGTTCTGCAC TGGGTTCGCC GGGCTCCAGG GA -             #AGGGTCCG    480                                                                  - - GAGTGGGTAT CAGCTATTGG TACTGGTGGT GATACATACT ATGCAGACTC CG -             #TGATGGGC    540                                                                  - - CGATTCACCA TCTCCAGAGA CAACGCCAAG AAGTCCTTGT ATCTCAAATG AA -             #CAGCCTGA    600                                                                  - - TAGCTGAGGA CATGGCTGTG TATTATG          - #                  - #                 627                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:48:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 743 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48:                               - - AAGGGTCCCC ACCCTAGAGC TTGCTATATA GTAGGAGATA TCCAAATAGG NC -              #CCTCCCTC     60                                                                  - - TACTGATGAA AACCCAACCC AACCCTGACC CTGCAGCTCT CAGAGAGGTG CC -             #TTAGCCCT    120                                                                  - - GGATTCCAAG GCATTTCCAC TTGGTGATCA GCACTGAACA CAGAGGACTC AC -             #CATGGAGT    180                                                                  - - TGGGGCTGTG CTGGGTTTTC CTTGTTGCTA TTTTAGAAGG TGATTCATGG AA -             #AACTAGAG    240                                                                  - - AGATTTAGTG TGTGTGGATA TGAGTGAGAG AAACAGTGGA TATGTGTGGC AG -             #TTTCTGAC    300                                                                  - - CTTGGTGTCT CTTTGTTTGC AGGTGTCCAG TGTGAGGTGC AGCTGGTGGA GT -             #CTGGGGGA    360                                                                  - - GGCTTGGTAC AGCCTGGGGG GTCCCTGAGA CTCTCCTGTG CAGCCTCTGG AT -             #TCACCTTC    420                                                                  - - AGTAGCTATA GCATGAACTG GGTCCGCCAG GCTCCAGGGA AGGGGCTGGA GT -             #GGGTTTCA    480                                                                  - - TACATTAGTA GTAGTAGTAG TACCATATAC TACGCAGACT CTGTGAAGGG CC -             #GATTCACC    540                                                                  - - ATCTCCAGAG ACAATGCCAA GAACTCACTG TATCTGCAAA TGAACAGCCT GA -             #GAGCCGAG    600                                                                  - - GACACGGCTG TGTATTACTG TGCGAGAGAC ACAGTGAGGG GAGGTCAGTG TG -             #ACACCAGA    660                                                                  - - CACAAACCTC CCTGCAGGGG TCCGCAGGAC CACCAGGGGG CGACAGGACA CT -             #GAGCACGG    720                                                                  - - GGCTGTCTCC AGGGCAGGTG CAG           - #                  - #                    743                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:49:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 763 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49:                               - - TCACCCAACT CCTCCAGGCA CAGTCATCTT ATCTGGCCCC GTCCTCTCCT CA -              #GNTGTCCC     60                                                                  - - ACCCCAGAGC TTGGTATATA GTAGGAGACA TNCAAATAAG GCCCTCCCTC TG -             #CTGATGAA    120                                                                  - - AATGAGCCCA GCCCTGACCC TGCAGCTCTG GGAGAGGAGC CCCANCCGTG AG -             #ATTCCCAG    180                                                                  - - GAGTTTCCAC TTGGTGATCA GCACTGAACA CAGACCACCA ACCATGGAGT TT -             #GGGCTTAG    240                                                                  - - CTGGGTTTTC CTTGTTGCTA TTTTAAAAGG TAATTCATGG TGTACTAGAG AT -             #ACTGAGTG    300                                                                  - - TGAGGGGACA TGAGTGGTAG AAACAGTGGA TATGTGTGGC AGTTTCTGAC CT -             #TGGTGTTT    360                                                                  - - CTGTGTTTGC AGGTGTCCAA TGTGAGGTGC AGCTGGTGGA GTCTGGGGGA GG -             #CTTGGTAC    420                                                                  - - AGCCAGGGCG GTCCCTGAGA CTCTCCTGTA CAGCTTCTGG ATTCACCTTT GG -             #TGATTATG    480                                                                  - - CTATGAGCTG GTTCCGCCAG GCTCCAGGGA AGGGGCTGGA GTGGGTAGGT TT -             #CATTAGAA    540                                                                  - - GCAAAGCTTA TGGTGGGACA ACAGAATACA CCGCGTCTGT GAAAGGCAGA TT -             #CACCATCT    600                                                                  - - CAAGAGATGG TTCCAAAAGC ATCGCCTATC TGCAAATGAA CAGCCTGAAA AC -             #CGAGGACA    660                                                                  - - CAGCCGTGTA TTACTGTACT AGAGACACAG TGNGGGGAGG TCAATGTGAG CC -             #CAGACACA    720                                                                  - - GACCTCCCTG CAGGCCCGCA CAGAGCCACC AGGGGGCGCT AGG    - #                       - #763                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:50:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 283 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:50:                               - - TGGCTCACCA TGGAGTTAGG GCTGAGCTGG GTTTCCCTTG TCATTATATT AA -              #AAGGCGAA     60                                                                  - - TAATGGAGAA CTTGAGATAT GGAGTGTGAG TGGATATGAG TGAAGAAACA GT -             #GATTCTGT    120                                                                  - - GTGGCAGGTT CTGACTCAGA TGTCCTCTGT GCTTGTAGGT GTCTAGTGTG GG -             #GTGCAGAT    180                                                                  - - GGTGGAGTCT TGGGGAGAGT TGGCACAANC TGAATGTGCC TGAGACTCTG CC -             #GTGCATCC    240                                                                  - - TCTGAATCCA CCTTCTGTAG CTACTAGATC AGCTGAATCT GCC    - #                       - #283                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:51:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 700 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51:                               - - AGGTTCTGGG TTATAAACNC TGTAGACTCC TCCCTTCAGG GCAGGNTGAC CA -              #ACTATGCA     60                                                                  - - AATGCAAGTG GGGGCCTCCC CACTTAAACC CAGGGCTCCC CTCCACAGTG AG -             #TCTCCCTC    120                                                                  - - ACTGCCCAGC TGGGATCTCA GGGCTTCATT TTCTGTCCTC CACCATCATG GG -             #GTCAACCG    180                                                                  - - CCATCCTCGC CCTCCTCCTG GCTGTTCTCC AAGGTCAGTC CTGCCGAGGG CT -             #TGAGGTCA    240                                                                  - - CAGAGGAGAA CGGGTGGAAA GGAGCCCCTG ATTCAAATTT TGTGTCTCCC CC -             #ACAGGAGT    300                                                                  - - CTGTTCCGAG GTGCAGCTGG TGCAGTCTGG AGCAGAGGTG AAAAAGCCCG GG -             #GAGTCTCT    360                                                                  - - GAAGATCTCC TGTAAGGGTT CTGGATACAG CTTTACCAGC TACTGGATCG GC -             #TGGGTGCG    420                                                                  - - CCAGATGCCC GGGAAAGGCC TGGAGTGGAT GGGGATCATC TATCCTGGTG AC -             #TCTGATAC    480                                                                  - - CAGATACAGC CCGTCCTTCC AAGGCCAGGT CACCATCTCA GCCGACAAGT CC -             #ATCAGCAC    540                                                                  - - CGCCTACCTG CAGTGGAGCA GCCTGAAGGC CTCGGACACC GCCATGTATT AC -             #TGTGCGAG    600                                                                  - - ACACACAGTG AGAGAAACCA GCCCCGAGCC CGTCTAAAAC CCTCCACACC GC -             #AGGTGCAG    660                                                                  - - AATGAGCTGC TAGAGACTCA CTCCCCAGGG GCCTCTCTAT     - #                       - #   700                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:52:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 767 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52:                               - - ACACTCACCT GCTCTGGGCT CCTCCAAACT CTCCTCAGGA TTCCCCACCC CA -              #GAGCTTGC     60                                                                  - - TATATAGTAG GAGACATCCA AACAAGAGCC NAAACCTCTG CTGATGAAAA GC -             #AGCCCAGC    120                                                                  - - CCTGACCCTG CAGCTCTGGG AGAGGAGCCC CAGCTCCAGG ATTCCCAGGT CT -             #TTCCATTT    180                                                                  - - AGTCTTCAGG GCTGAGCACA GAGGACTCAC CATGGAGTCT GGGCTGAGCT GG -             #GTTTTCCT    240                                                                  - - TGTTGCTATT TTGAAAGGTG ATTCATGGGG AATGAGTTGA ATGTAAGTGA AT -             #ATGAGTGA    300                                                                  - - GAGAAACAGT GGATGTGTGC GGCAGTTTCT GACCAGGGTG TCTCTGTGTT TG -             #CAGGTGTC    360                                                                  - - CAGTGTGAGG TGCAGCTGGT GGAGTCTGGG TGAGGCTTGG TACAGCCTGG AG -             #GGTCCCTG    420                                                                  - - AGACTCTCCT GTGCAGCCTC TGGATTCACC TTCAGTAGCT CCTGGATGCA CT -             #GGGTCTGC    480                                                                  - - CAGGCTCCGG AGAAGGGGCT GGAGTGGGTG GCCGACATAA AGTGTGACGG AA -             #GTGAGAAA    540                                                                  - - TACTATGTAG ACTCTGTGAA GGGCCGATTG ACCATCTCCA GAGACAATGC CA -             #AGAACTCC    600                                                                  - - CTCTATCTGC AAGTGAACAG CCTGAGAGCT GAGGACATGA CCGTGTATTA CT -             #GTGTGAGA    660                                                                  - - GGCACAGTGA GGGGAGGTCA GTGTGAGCCC AGACACAAAC CTCCTGCAGG GG -             #CATCTGGA    720                                                                  - - GCCACAAGGG GGCGCTCAGG ATACACAGAG GGACAGGGGC AGCCCCA   - #                    767                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:53:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 724 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53:                               - - CCATTCGGTG ATCAGCACTG AACACAGAGG ACTCACCATG GAGTTTTGGC TG -              #AGCTGGGT     60                                                                  - - TTTCCTTGTT GCTATTTTAA AAGGTGATTC ATGGAGAACT AGAGATATTG AG -             #TGTGAGTG    120                                                                  - - AACACGAGTG AGAGAAACAG TGGATATGTG TGGCAGTTTC TAACCAATGT CT -             #CTGTGTTT    180                                                                  - - GCAGGTGTCC AGTGTGAGGT GCAGCTGGTG GAGTCTGGAG GAGGCTTGAT CC -             #AGCCTGGG    240                                                                  - - GGGTCCCTGA GACTCTCCTG TGCAGCCTCT GGGTTCACCG TCAGTAGCAA CT -             #ACATGAGC    300                                                                  - - TGGGTCCGCC AGGCTCCAGG GAAGGGGCTG GAGTGGGTCT CAGTTATTTA TA -             #GCGGTGGT    360                                                                  - - AGCACATACT ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CA -             #ATTCCAAG    420                                                                  - - AACACGCTGT ATCTTCAAAT GAACAGCCTG AGAGCCGAGG ACACGGCCGT GT -             #ATTACTGT    480                                                                  - - GCGAGAGACA CAGTGAGGGG AAGTCATTGT GCGCCCAGAC ACAAACCTCC CT -             #GCAGGAAC    540                                                                  - - GCTGGGGGGA AATCAGCGGN AGGGGGCGCT CAGGAGCCAC TGATCAGAGT CA -             #GCCCCGGA    600                                                                  - - GGCAGGTGCA GATGGAGGCT GATTTCCTTG TCAGGATGTG GGGACTTTTG TC -             #TTCTTCTG    660                                                                  - - ACGGGTTCCC CAGGGGAACC TCTCTAAGTT TAGCATTCTG TGCCTATGAA CG -             #TCTTCTCT    720                                                                  - - AAGT                 - #                  - #                  - #                 724                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:54:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 706 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:54:                               - - CTTGCTATAC AGTAGGAGAC ATGCNAATAG GTTTCTCCCT CTGCTGATGA CC -              #AGTCCTGA     60                                                                  - - CCCCATAGCT CTGGGAGAGA AGCGCCAGCC CTGGGATTCC CAGGGGTTTC CA -             #TTTGGTGA    120                                                                  - - TCAGGACTAA AGACAGAGGA CCCACCATGG AGCTTGGGCT GAGCTGGGTT TT -             #CACTGTTA    180                                                                  - - CTGTTTTAAA AGGTGAACTA GAGAGATTGA GTGTGAATGG ATACACTTGA GA -             #GAAACAGT    240                                                                  - - GGATATGTCT GGAACTTTCT GACCAGGACA CCTACAAGTT TGCAGGTGTC CA -             #GTGTGAGG    300                                                                  - - TACAGCTGGT GGAGTCTGAA GAAAACCAAA GACAACTTGG GGGATCCCTG AG -             #ACTCTCCT    360                                                                  - - GTGCAGACTC TGGATTAACC TTCAGTAGCT ACTGAATGAG CTCAGATTCC CA -             #AGCTCCAG    420                                                                  - - GGAAGGGGCT GGAGTGAGTA GTAGATATAT AGTAGGATAG AAGTCAGCTA TG -             #TTATGCAC    480                                                                  - - AATCTGTGAA GAGCAGATTC ACCATCTCCA AAGAAAATGC CAAGAACTCA CT -             #CTGTTTGC    540                                                                  - - AAATGAACAG TCTGAGAGCA GAGGGCACGG CCGTGTATTA CTGTATGTGA GT -             #CACCAGGT    600                                                                  - - AAGAAGACAT CAGTGTGATC ACAGACACAG AATTTCCTGA AATAAGGGAG GA -             #GTCTGGGC    660                                                                  - - TAAAAGGGCA CTCAGGACCC ACAGAAAACA GCGGAAGCTC TAGGGC   - #                     706                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:55:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 800 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:55:                               - - GGAAGAGANC TTGATTCTCA AGAGGGCACA GCCAGCTTCC TACTCCCAGG GC -              #AAGCCCCA     60                                                                  - - AAAGACTGGG NCCTCCCTCC TCCCTTTTCA CCTGTCCATA CAAAGTCACC GC -             #CCACATGC    120                                                                  - - AAATCCTCAC TTAGGCACCT ACAGGAAACC AGCACACATT TCCTTAAATT TG -             #GGATCCAG    180                                                                  - - CTCACATGGG AAATACTTTC TGAGACTCAT GGGCCTCCTG CACAAGAACA TG -             #AAACACCT    240                                                                  - - GTGGTTCTTC CTCCTGCTGG TGGCAGCTCC CAGATGTGAG TGCCTCAGGG AT -             #CCAGACCT    300                                                                  - - GAAGATATGA GATGCTGCCT CTCATCCCAG GGCTCACCGT GGTTCTCTCT GT -             #TCACAGGG    360                                                                  - - GTCCTGTCCC AGGTGCAGCT GCAGGAGTCG GGCCCAGGAC TGGTGAAGCC TT -             #CGGAGACC    420                                                                  - - CTGTCCCTCA TCTGCGCTGT CTCTGGTGAC TCCATCAGCA GTGGTAACTG GT -             #GAATCTGG    480                                                                  - - GTCCGCCAGC CCCCAGGGAA GGGGCTGGAG TGGATTGGGG AAATCCATCA TA -             #GTGGGAGC    540                                                                  - - ACCTACTACA ACCCGTCCCT CAAGAGTCGA ATCACCATGT CCGTAGACAC GT -             #CCAAGAAC    600                                                                  - - CAGTTCTACC TGAAGCTGAG CTCTGTGACC GCCGCGGACA CGGCCGTGTA TT -             #ACTGTGCG    660                                                                  - - AGATACACAG TGAGGGGAGG TGAGTGTGAG CCCAGACACA AACCTCCCTA CA -             #GATAGGCA    720                                                                  - - GAGGGGGNGG GCACAGGTGC TGCTCAGGAN CAACAGGGGG CGCGCGANGN CA -             #CAGAGCCC    780                                                                  - - GAGGNCCGGG TCANGAGCAG            - #                  - #                       - #800                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:56:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 429 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56:                               - - AGGAATTGGG CTATTCAATG CATCCTTCGT GAATATGCAA ATCACTAAGG TT -              #AATACAGA     60                                                                  - - TATCTCTGTG CCGTGAGAGC ATCACCCAAC AACCACACCC CTCCTTGGAG AA -             #TCCCTAGA    120                                                                  - - TCACAGCTCC TCACCATGGA CTGGACCTGG AGCATCCTCT TCTTGGTGGC AG -             #CAGCAACA    180                                                                  - - GGTAAGGACT CCCCAGTCCC AGGGCTGAGG GAGAAACCAG GCCAGTCATG TG -             #AGACTTCA    240                                                                  - - CCCACTGCTG TCTCCTCTCC ACAGGTGCCC ACTCCCGAGT GCAGCTGGTG CA -             #GTCTGGGC    300                                                                  - - CTGAGGTGAA GCAGCCTGGG GCCTCGGCGA AGGTCTCCTG CAAGGTGTCT GG -             #TTAAACTG    360                                                                  - - TCATCACCTA TGGTATGAAT TGGATACGAC AGACCCCAGG ACAGGGGCTT GA -             #GTGGATGG    420                                                                  - - GATGGATCC                - #                  - #                       - #        429                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:57:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 462 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:57:                               - - CATCAGTTGC GCTCAGGAGT TTTAGAACAG CCTGGCAACA CATTTAGATC TG -              #GGCTTCCC     60                                                                  - - TTCTCATCAC CCTCAATATT AGTGTCCCTT GTGAATCAGG TCCAGCTGCG GC -             #TGTTCCAC    120                                                                  - - ATGGGGCCGT TCTTCCATTT CCTCAGTGTT TGCAGAAGTC CTGTGTGAAG TT -             #TATTGATG    180                                                                  - - GAGTCAGAGG CAGAAAATTG TACAGCCCAG TGGTTCACTG AGACTCTCCT GC -             #AAAGGCTC    240                                                                  - - TGATTTCACC TTTACTGGCT ACAGCATGAG CTTGGTCCAG CAGGCTTCAT GA -             #CAGGGATT    300                                                                  - - GGTGTGGGTG GAAACAGTGA GTAGTCAAGT GGGAGTTCTC AGAGTTACTC TC -             #CATGAGTA    360                                                                  - - CAAATAAATT AACAGTCCCA AGCGACACCT TTTCATGTGC AGTCTACCTT AC -             #AATGACCA    420                                                                  - - ACCTGAAAGT CCAAGGACAA GGCTGTGTAT TACTGTGAGG GA    - #                       - # 462                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:58:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 629 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58:                               - - AGGGTCTTCA GCTATGAAAT GCTCTGACTC ATGAATATGC AAATAACCTG AG -              #ATGCACTG     60                                                                  - - AGGTAAATAT GGATATTTGT CAGCCCTGAG AGCATCATCC AGAAACCACA TC -             #CCTCCGCT    120                                                                  - - AGAGAAGCCC TGACGGCACA GTTCCTCACT ATGGACTGGA TTTGGAGGAT CC -             #TCTTCTTG    180                                                                  - - GTGGGAGCAG CGACAGGCAA GGAGATGCCA AGTCCCAGTG ATGAGGAGGG GA -             #TTGAGTCC    240                                                                  - - AGTCAAGGTG GCTTTCATCC ACTCCTGTGT TCTCTCCACA GGTGCCCACT CC -             #CAAATGCA    300                                                                  - - GCTGGTGCAG TCTGGGCCTG AGGTGAAGAA GCCTGGGACC TCAGTGAAGG TC -             #TCCTGCAA    360                                                                  - - GGCTTCTGGA TTCACCTTTA CTAGCTCTGC TGTGCAGTGG GTGCGACAGG CT -             #CGTGGACA    420                                                                  - - ACGCCTTGAG TGGATAGGAT GGATCGTCGT TGGCAGTGGT AACACAAACT AC -             #GCACAGAA    480                                                                  - - GTTCCAGGAA AGAGTCACCA TTACCAGGGA CATGTCCACA AGCACAGCCT AC -             #ATGGAGCT    540                                                                  - - GAGCAGCCTG AGATCCGAGG ACACGGCCGT GTATTACTGT GCGGCAGACA CA -             #GTGTGAAA    600                                                                  - - ACCCACATCC TGAGAGTGTC AGAAACGCC         - #                  - #                629                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:59:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 622 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:59:                               - - CCTCCTTTTT CACCTCTCCA TACAAAGGCA CCACCCACAT GCAAATCCTC AC -              #TTAAGCAC     60                                                                  - - CCACAGGAAA CCACCACACA TTTCCTTAAA TTCAGGTTCC AGCTCACATG GG -             #AAATACTT    120                                                                  - - TCTGAGAGCT CTGGACCTCC TGTGCAAGAA CATGAAACAT CTGTGGTTCT TC -             #CTTCTCCT    180                                                                  - - GGTGGCAGCT CCCAGATGTG AGTATCTCAG GGATCCAGAC ATGGGGATAT GG -             #GAGGTGCC    240                                                                  - - TCTGATCCCA GGGCTCACTG TGGGTCTCTC TGTTCACAGG GGTCCTGTCC CA -             #GGTGCAGC    300                                                                  - - TGCAGGAGTC GGGCCCAGGA CTGGTGAAGC CTTCGGAGAC CCTGTCCCTC AC -             #CTGCACTG    360                                                                  - - TCTCTGGTGG CTCCGTCAGT AGTTACTACT GGAGCTGGAT CCGGCAGCCC CC -             #AGGGAAGG    420                                                                  - - GACTGGAGTG GATTGGGTAT ATCTATTACA GTGGGAGCAC CAACTACAAC CC -             #CTCCCTCA    480                                                                  - - AGAGTCGAGT CACCATATCA GTAGACACGT CCAAGAACCA GTTCTCCCTG AA -             #GCTGAGCT    540                                                                  - - CTGTGACCGC TGCGGACACG GCCGTGTATT ACTGTGCGAG AGACACAGTG AG -             #GGGAGGTG    600                                                                  - - AGTGTGAGCC CAGACAAAAA CC           - #                  - #                     622                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:60:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 588 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:60:                               - - CCCGGGATTC CCAGCTGTCT CCACTTGGTC ATGAACACTG AACACAGAAG AC -              #ACACCATG     60                                                                  - - GAGTCTGGGC TGAGCTGGAT TTTCCTTGTT GCAGTTTTAA AAGGTGATTT AT -             #GGAGAATA    120                                                                  - - GACACACTGA GTGTGACTGG ACATAAGTGA GAGAAACAGT GGATTTGTGT GG -             #CAGTTTCT    180                                                                  - - GACCAGGGTG TCTCCGTGTT TGCAGGTGTC CAGTGTGAGG TGCAGCTGGT GG -             #AGTCTGGG    240                                                                  - - GGAGGCTTAG TAAAGACTGG GGGGTCTCTG AGACTCTCCT GTGCAGCCTC TG -             #GATTCACC    300                                                                  - - TTCAGTAGCT CTGCTATGCA CTGGGTCCAC CAGGCTCCAG GAAAGGGTTT GG -             #AGTGGGTC    360                                                                  - - TCAGTTATTA GTACAAGTGG TGATACCGTA CTCTACACAG ACTCTGTGAA GG -             #GCTGATTC    420                                                                  - - ACCATCTCTA GAGACAATGC CCAGAATTCA CTGTATCTGC AAATGAACAG CC -             #TGAGAGCC    480                                                                  - - GACGACATGG CTGTGTATTA CTGTGTGAAA GACGCAGTGA GAAGTCAGTG TG -             #AGCCCAGA    540                                                                  - - CACAAACCTC CTGCAGGGTA CCTGGGACAA CCAGGGAAAG CCTGGGAC  - #                    588                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:61:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1212 base - #pairs                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:61:                               - - CCTCCTTTTT CACCTCTCCG TACAAAGGCA CCACCCACAT GCAAATCCTT AC -              #TTAAGCAC     60                                                                  - - CCACAGGAAA CCACCACACA TTTCCTTAAA TTCAGGTTCC AGCTCACATG GG -             #AAATACTT    120                                                                  - - TCTGAGAGCC TGGACCTCCT GTGCAAGAAC ATGAAACACC TGTGGTTCTT CC -             #TCCTCCTG    180                                                                  - - GTGGCAGCTC CCAGATGTGA GTGTCTCAGG GATCCAGACA TGGGGGTATG GG -             #AGGTGCCT    240                                                                  - - CTGATCCCAG GGCTCACTGT GGGTCTCTCT GTTCACAGGG GTCCTGTCCC AG -             #GTGCAGCT    300                                                                  - - GCAGGAGTCG GGCCCAGGAC TGGTGAAGCC TTCGGAGACC CTGTCCCTCA CC -             #TGCACTGT    360                                                                  - - CTCTGGTGGC TCCGTCAGCA GTGGTAGTTA CTACTGGAGC TGGATCCGGC AG -             #CCCCCAGG    420                                                                  - - GAAGGGACTG GAGTGGATTG GGTATATCTA TTACAGTGGG AGCACCAACT AC -             #AACCCCTC    480                                                                  - - CCTCAAGAGT CGAGTCACCA TATCAGTAGA CACGTCCAAG AACCAGTTCT CC -             #CTGAAGCT    540                                                                  - - GAGCTCTGTG ACCGCTGCGG ACACGGCCGT GTATTACTGT GCGAGAGACA CA -             #GTGAGGGG    600                                                                  - - AGGTGAGTGT GAGCCCAGGA CACAAACCTC CCTCATGGAC GCGGAGGGGA CC -             #GGCGCAGG    660                                                                  - - TGCTGCTCAG GACCAGCAGG TGGCGCGCGG GGCCCCCAGA GCATGAGGCC GG -             #GTCAGGAC    720                                                                  - - AGGTGCAGGG AGGGCTTCCT CATCTGCTCA CTGGTCTCCG TCCTCGCCAG CA -             #CCTCGCTG    780                                                                  - - TCACCAGGGC TCCTCTTTCT TTATTATCTG TGGTTCTGCT TCCTCACATT CT -             #TGTGCCAG    840                                                                  - - GAAAGAAACG AGGAAGACGG GTTTTCGTCT ATAGTTGAAG CTTTTACTAG GA -             #TCTTGCCT    900                                                                  - - ACAAGTTCCT GCATGACCCA TTATAACTTA TCGATTAAAA AATATATATT CT -             #AATGCTTC    960                                                                  - - TCACCATCTC TTGATTTGTA TCATCAACTG AATTGTACCC TCTTTGAAAT TC -             #ATATGATG   1020                                                                  - - AAACCTTAAA TTCAATGGAT CTATATTGGA ATTTTAATGA AATAATTAAG GT -             #TAAATGTG   1080                                                                  - - GTCATAATTG TAAGACCCTA ATGCAATAGA CGTGTTGTCT TTATAAGAAG AG -             #GAAGAGAC   1140                                                                  - - ACCAGAGACC TCTCACTTTT CACGTGCAGG CAGAGAAGAG GCCATGTGGA GA -             #CATAGTGC   1200                                                                  - - ACTAGAAGGT GG              - #                  - #                       - #     1212                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:62:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 560 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62:                               - - GATCAGCACT GAACACAGAG GACTCACCAT GGAGTTTGGG CTGAGCTGGG TT -              #TTCCTTGT     60                                                                  - - TGCTAATTTA AGAGGTGATT CATAGATAAA TAGAGATGTT GAGTGGGAGT GG -             #ACATGAGT    120                                                                  - - GAGAGAAACA GTGGATGTGT GTGGCAGTTT CTGACCTTGG TGTCTTTGTG TT -             #TGCAGGTG    180                                                                  - - TCCAGTGTGA GGTGCAGCTG GTGGAGTCTG GGGAAGGCTT GGTCCAGCCT GG -             #GGGGTCCC    240                                                                  - - TGAGACTCTC CTGTGCAGCC TCTGGATTCA CCTTCAGTAG CTCTGCTATG CA -             #CTGGGTCC    300                                                                  - - GCCAGGCTCC AAGAAAGGGT TTGTAGTGGG TCTCAGTTAT TAGTACAAGT GG -             #TGATACCG    360                                                                  - - TACTCTACAC AGACTCTGTG AAGGGCCGAT TCACCATCTC CAGAGACAAT GC -             #CCAGAATT    420                                                                  - - CACTGTCTCT GCAAATGAAC AGCCTGAGAG CCGAGGGCAC AGTTGTGTAC TA -             #CTGTGTGA    480                                                                  - - AAGACGCAGT GAGAAGTCAG TGTGAGCCCA GACACAAACC TCCTGCAGGG TA -             #CCTGGGAC    540                                                                  - - AATCAGGGAA AGCCTGGGAC            - #                  - #                       - #560                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:63:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 515 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:63:                               - - GAGCTCACTA TGGGGTTTGA GCTAACCAGA ATTTTTCTTG TTGCTATTTT AA -              #AAGGTGAC     60                                                                  - - TCATAGAGAA ATAGAGTGAG TGAGAGTGAG TGGATATAAG TGAGAAAAAC AG -             #TAGATGTG    120                                                                  - - TTTGGCAGTT TCTGACCAGG ACGTTTGTGT ATTTTCAGGT GTTCAGTGTG AG -             #GTGGAGCT    180                                                                  - - GATAGAGTCC ATAGAGGGCC TGAGACAACT TGGGAAGTTC CTGAGACTCT CC -             #TGTGTAGC    240                                                                  - - CTCTGGATTC ACCTTCAGTA GCTACTGAAT GAGCTGGGTC AATGAGACTC TA -             #GGGAAGGG    300                                                                  - - GCTGGAGGGA GTAATAGATG TAAAATATGA TGGAAGTCAG ATATACCATG CA -             #GACTCTGT    360                                                                  - - GAAGGGCAGA TTCACCATCT CCAAAGACAA TGCTAAGAAC TCACCGTATC TC -             #CAAACGAA    420                                                                  - - CAGTCTGAGA GCTGAGGACA TGACCATGCA TGGCTGTACA TAAGGTTCCA AG -             #TGAGGAAA    480                                                                  - - CATCGGTGTG AGTCCAGACC AAAATTTCCT GCAGG       - #                        - #      515                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:64:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 649 base - #pairs                                                  (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Homo sapi - #ens                                                 (G) CELL TYPE: human - #lymphoblast                                            (H) CELL LINE: CGM1                                                   - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64:                               - - AGCTCTGGGA GAGGAGCCCC CCCCCTGGGA TTCCCAGGTG TTTTCATTTG GT -             #GATCAGCA     60                                                                  - - CTGAACACAG AAGAGTCATG ACGGAGTTTG GGCTGAGCTG GGTTTTCCTT GT -             #TGCTATTT    120                                                                  - - TTAAAGGTGA TTCATGAGGA AATAGAGATA TTGAGTGTGA GTGGACATGA GT -             #GAGAGAAA    180                                                                  - - CAGTGGATTT GTGTGGCAGT TTCTGACCTT GGTGTCTCTG TGTTTGCAGG TG -             #TCCAGTGT    240                                                                  - - GAGGTGCAGC TGGTGGAGTC TGGGGGAGGC TTGGTCCAGC CTGGGGGGTC CC -             #TGAGACTC    300                                                                  - - TCCTGTGCAG CCTCTGGATT CACCTTCAGT AGCTATGCTA TGCACTGGGT CC -             #GCCAGGCT    360                                                                  - - CCAGGGAAGG GACTGGAATA TGTTTCAGCT ATTAGTAGTA ATGGGGGTAG CA -             #CATATTAT    420                                                                  - - GCAAACTCTG TGAAGGGCAG ATTCACCATC TCCAGAGACA ATTCCAAGAA CA -             #CGCTGTAT    480                                                                  - - CTTCAAATGG GCAGCCTGAG AGCTGAGGAC ATGGCTGTGT ATTACTGTGC GA -             #GAGACACA    540                                                                  - - GTGAGGAGAA GTTAATGTGG GACCATGCAG AAACCTCCCT GCGGGAACGC TG -             #GGGAAAGT    600                                                                  - - CATCTGCAGG GGGCGCTCAG GAGCCACTGA TCAGCGTCAA CCGCAGCGG  - #                   649                                                                         - -  - - (2) INFORMATION FOR SEQ ID NO:65:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:65:                               - - AGGTGCAGCT GGTGCAGTCT G           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:66:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:66:                               - - CCAGGGGCCT GTCGCACCCA            - #                  - #                       - # 20                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:67:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67:                               - - TGGGGCCTCA GTGAAGGTCT CCTG          - #                  - #                     24                                                                       - -  - - (2) INFORMATION FOR SEQ ID NO:68:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68:                               - - GATCCATCCC ATCCACTCAA G           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:69:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:69:                               - - GATCCGTCCC ATCCACTCAA G           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:70:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:70:                               - - TGTCTTCTCC ACAGGGGTCT T           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:71:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:71:                               - - GGGAAGGCCC TGGAGTGGCT            - #                  - #                       - # 20                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:72:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:72:                               - - GTGCAGGTCA GCGTGAGGGT            - #                  - #                       - # 20                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:73:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 22 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:73:                               - - TGGTTTTTGG AGGTGTCCTT GG           - #                  - #                      22                                                                       - -  - - (2) INFORMATION FOR SEQ ID NO:74:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 23 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:74:                               - - CACTCCAGCC CCTTCCCTGG AGC           - #                  - #                     23                                                                       - -  - - (2) INFORMATION FOR SEQ ID NO:75:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:75:                               - - GTGAGGTTCA GCTGGTGGAG T           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:76:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:76:                               - - AGCTGAACCT CACACTGGAC            - #                  - #                       - # 20                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:77:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 19 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:77:                               - - AAGGGCCGAT TCACCATCT             - #                  - #                       - # 19                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:78:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 20 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:78:                               - - TTGTCTCTGG AGATGGTGAA            - #                  - #                       - # 20                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:79:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 24 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:79:                               - - TGAGACTCTC CTGTGCAGCC TCTG          - #                  - #                     24                                                                       - -  - - (2) INFORMATION FOR SEQ ID NO:80:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 19 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:80:                               - - TCTTTGTGTT TGCAGGTGT             - #                  - #                       - # 19                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:81:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 19 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:81:                               - - TCTCTGTGTT TGCAGGTGT             - #                  - #                       - # 19                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:82:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:82:                               - - TCTGTTCACA GGGGTCCTGT C           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:83:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 19 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:83:                               - - TCCGGCAGCC CCCAGGGAA             - #                  - #                       - # 19                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:84:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 18 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:84:                               - - GCAGGTGAGG GACAGGGT             - #                  - #                       - #  18                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:85:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:85:                               - - CAGGGAGAAC TGGTTCTTGG A           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:86:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:86:                               - - CCCGGGCATC TGGCGCACCC A           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:87:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:87:                               - - GCTGCTCCAC TGCAGGTAGG C           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:88:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 21 base - #pairs                                                   (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: Genomic DNA                                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:88:                               - - CTTCAGGCTG CTCCACTGCA G           - #                  - #                       - #21                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:89:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 121 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:89:                               - - Met Ser Val Ser Phe Leu Ile Phe Leu Pro Va - #l Leu Gly Leu Pro Trp         1               5 - #                 10 - #                 15               - - Gly Val Leu Ser Gln Val Gln Leu Gln Gln Se - #r Gly Pro Gly Leu Val                    20     - #             25     - #             30                   - - Lys Pro Ser Gln Thr Leu Ser Leu Thr Cys Al - #a Ile Ser Gly Asp Ser                35         - #         40         - #         45                       - - Val Ser Ser Asn Ser Ala Ala Trp Asn Trp Il - #e Arg Gln Ser Pro Ser            50             - #     55             - #     60                           - - Arg Gly Leu Glu Trp Leu Gly Arg Thr Tyr Ty - #r Arg Ser Lys Trp Tyr        65                 - # 70                 - # 75                 - # 80        - - Asn Asp Tyr Ala Val Ser Val Lys Ser Arg Il - #e Thr Ile Asn Pro Asp                        85 - #                 90 - #                 95               - - Thr Ser Lys Asn Gln Phe Ser Leu Gln Leu As - #n Ser Val Thr Pro Glu                   100      - #           105      - #           110                   - - Asp Thr Ala Val Tyr Tyr Cys Ala Arg                                               115          - #       120                                              - -  - - (2) INFORMATION FOR SEQ ID NO:90:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:90:                               - - Met Asp Trp Thr Trp Arg Ile Leu Phe Leu Va - #l Ala Ala Ala Thr Gly         1               5 - #                 10 - #                 15               - - Ala His Ser Gln Val Gln Leu Val Gln Ser Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Ala Ser Val Lys Val Ser Cys Lys Al - #a Ser Gly Tyr Thr Phe                35         - #         40         - #         45                       - - Thr Gly Tyr Tyr Met His Trp Val Arg Gln Al - #a Pro Gly Gln Gly Leu            50             - #     55             - #     60                           - - Glu Trp Met Gly Trp Ile Asn Pro Asn Ser Gl - #y Gly Thr Asn Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Gln Lys Phe Gln Gly Arg Val Thr Met Thr Ar - #g Asp Thr Ser Ile Ser                        85 - #                 90 - #                 95               - - Thr Ala Tyr Met Glu Leu Ser Arg Leu Arg Se - #r Asp Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:91:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:91:                               - - Met Asp Trp Thr Trp Arg Ile Leu Phe Leu Va - #l Ala Ala Ala Thr Gly         1               5 - #                 10 - #                 15               - - Val His Ser Gln Val Gln Leu Val Gln Ser Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Ala Ser Val Lys Val Ser Cys Lys Al - #a Ser Gly Tyr Thr Phe                35         - #         40         - #         45                       - - Thr Ser Tyr Ala Met His Trp Val Arg Gln Al - #a Pro Gly Gln Arg Leu            50             - #     55             - #     60                           - - Glu Trp Met Gly Trp Ser Asn Ala Gly Asn Gl - #y Asn Thr Lys Tyr Ser        65                 - # 70                 - # 75                 - # 80        - - Gln Glu Phe Gln Gly Arg Val Thr Ile Thr Ar - #g Asp Thr Ser Ala Ser                        85 - #                 90 - #                 95               - - Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Se - #r Glu Asp Met Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:92:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 116 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:92:                               - - Met Lys His Leu Trp Phe Phe Leu Leu Leu Va - #l Ala Ala Pro Arg Trp         1               5 - #                 10 - #                 15               - - Val Leu Ser Gln Val Gln Leu Gln Glu Ser Gl - #y Pro Gly Leu Val Lys                    20     - #             25     - #             30                   - - Pro Ser Glu Thr Leu Ser Leu Thr Cys Thr Va - #l Ser Gly Gly Ser Ile                35         - #         40         - #         45                       - - Ser Ser Tyr Tyr Trp Ser Trp Ile Arg Gln Pr - #o Ala Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Ile Gly Arg Ile Tyr Thr Ser Gly Se - #r Thr Asn Tyr Asn Pro        65                 - # 70                 - # 75                 - # 80        - - Ser Leu Lys Ser Arg Val Thr Met Ser Val As - #p Thr Ser Lys Asn Gln                        85 - #                 90 - #                 95               - - Phe Ser Leu Lys Leu Ser Ser Val Thr Ala Al - #a Asp Thr Ala Val Tyr                   100      - #           105      - #           110                   - - Tyr Cys Ala Arg                                                                   115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:93:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 119 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:93:                               - - Met Asp Thr Leu Cys Ser Thr Leu Leu Leu Le - #u Thr Ile Pro Ser Trp         1               5 - #                 10 - #                 15               - - Val Leu Ser Gln Ile Thr Leu Lys Glu Ser Gl - #y Pro Thr Leu Val Lys                    20     - #             25     - #             30                   - - Pro Thr Gln Thr Leu Thr Leu Thr Cys Thr Ph - #e Ser Gly Phe Ser Leu                35         - #         40         - #         45                       - - Ser Thr Ser Gly Val Gly Val Gly Trp Ile Ar - #g Gln Pro Pro Gly Lys            50             - #     55             - #     60                           - - Ala Leu Glu Trp Leu Ala Leu Ile Tyr Trp As - #n Asp Asp Lys Arg Tyr        65                 - # 70                 - # 75                 - # 80        - - Ser Pro Ser Leu Lys Ser Arg Leu Thr Ile Th - #r Lys Asp Thr Ser Lys                        85 - #                 90 - #                 95               - - Asn Gln Val Val Leu Thr Met Thr Asn Met As - #p Pro Val Asp Thr Ala                   100      - #           105      - #           110                   - - Thr Tyr Tyr Cys Ala His Arg                                                       115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:94:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 112 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:94:                               - - Met Glu Leu Gly Leu Arg Trp Val Phe Leu Al - #a Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Met Gln Leu Val Glu Ser Gl - #y Ala Asn Leu Thr Lys                    20     - #             25     - #             30                   - - Pro Gly Cys Pro Asp Ser Pro Val Gln Pro Le - #u Asp Ser Pro Ser Val                35         - #         40         - #         45                       - - Ala Ile Ala Arg Thr Gly Ser Pro Arg Leu Gl - #n Gly Arg Val Cys Ser            50             - #     55             - #     60                           - - Gly Ser Gln Leu Leu Val Val Val Val Val Pr - #o Cys Thr Thr Gln Thr        65                 - # 70                 - # 75                 - # 80        - - Leu Arg Ala Asp Ser Pro Phe Pro Glu Thr Il - #e Pro Lys Thr His Cys                        85 - #                 90 - #                 95               - - Ile Cys Lys Thr Asp Gly Gln Arg Met Gln Le - #u His Met Thr Leu Glu                   100      - #           105      - #           110                   - -  - - (2) INFORMATION FOR SEQ ID NO:95:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:95:                               - - Met Glu Leu Gly Leu Ser Trp Val Phe Leu Va - #l Ala Ile Leu Glu Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Trp Met Ser Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ala Asn Ile Lys Gln Asp Gly Se - #r Glu Lys Tyr Tyr Val        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ala Lys Asn                        85 - #                 90 - #                 95               - - Ser Leu Tyr Leu Gln Met Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:96:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:96:                               - - Met Asp Trp Thr Trp Arg Ile Leu Phe Leu Va - #l Ala Ala Ala Thr Ser         1               5 - #                 10 - #                 15               - - Ala His Ser Gln Val Gln Leu Val Gln Ser Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Ala Ser Val Lys Val Ser Cys Lys Al - #a Ser Gly Tyr Thr Phe                35         - #         40         - #         45                       - - Thr Ser Tyr Asp Ile Asn Trp Val Arg Gln Al - #a Thr Gly Gln Gly Leu            50             - #     55             - #     60                           - - Glu Trp Met Gly Trp Met Asn Pro Asn Ser Gl - #y Asn Thr Gly Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Gln Lys Phe Gln Gly Arg Val Thr Met Thr Ar - #g Asn Thr Ser Ile Ser                        85 - #                 90 - #                 95               - - Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Se - #r Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:97:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 118 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:97:                               - - Met Glu Leu Gly Leu Ser Trp Ile Phe Leu Le - #u Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Arg Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Asp Asp Tyr Ala Met His Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Gly Ile Ser Trp Asn Ser Gl - #y Ser Ile Gly Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ala Lys Asn                        85 - #                 90 - #                 95               - - Ser Leu Tyr Leu Gln Met Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Leu                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Lys Asp                                                           115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:98:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 111 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:98:                               - - Met Glu Leu Tyr Ser Thr Leu Leu Leu Leu Th - #r Val Pro Ser Trp Val         1               5 - #                 10 - #                 15               - - Leu Ser Gln Val Thr Leu Lys Glu Ser Gly Pr - #o Ala Leu Val Lys Pro                    20     - #             25     - #             30                   - - Thr Gln Thr Leu Met Leu Thr Cys Thr Phe Se - #r Gly Phe Ser Leu Ser                35         - #         40         - #         45                       - - Thr Ser Gly Met Gly Val Gly Ile Cys Gln Pr - #o Ser Ala Lys Ala Leu            50             - #     55             - #     60                           - - Glu Trp Leu Ala His Ile Tyr Asn Asp Asn Ly - #s Tyr Tyr Ser Pro Ser        65                 - # 70                 - # 75                 - # 80        - - Leu Lys Ser Arg Leu Ile Ile Ser Lys Asp Th - #r Ser Lys Asn Glu Val                        85 - #                 90 - #                 95               - - Val Leu Thr Val Ile Asn Met Asp Ile Val As - #p Thr Ala Thr His                       100      - #           105      - #           110                   - -  - - (2) INFORMATION FOR SEQ ID NO:99:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:99:                               - - Met Glu Phe Gly Leu Ser Trp Val Phe Leu Va - #l Ala Ile Ile Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Gln Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Lys                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Asp Tyr Tyr Met Ser Trp Ile Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Tyr Ile Ser Ser Ser Gly Se - #r Thr Ile Tyr Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ala Lys Asn                        85 - #                 90 - #                 95               - - Ser Leu Tyr Leu Gln Met Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:100:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 144 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:100:                              - - Met Xaa Trp Thr Tyr Lys Ile Leu Phe Leu Va - #l Ala Ala Ala Thr Gly         1               5 - #                 10 - #                 15               - - Ala His Ser Gln Val Gln Leu Val Gln Ser Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Ala Ser Val Lys Val Ser Cys Lys Al - #a Ser Gly Tyr Thr Phe                35         - #         40         - #         45                       - - Thr Tyr Cys Tyr Leu His Trp Val Gln Ala Pr - #o Gly Gln Gly Leu Glu            50             - #     55             - #     60                           - - Trp Thr Gly Phe Leu Phe Glu Arg Phe Phe Il - #e Gln His Leu Phe Cys        65                 - # 70                 - # 75                 - # 80        - - Lys Gln Ile Ser Gly Ile Val Glu Ile Ile Le - #u Thr Asn Leu Thr Gln                        85 - #                 90 - #                 95               - - Asn Phe Leu Ile Asn Leu Cys Lys His Gln Ph - #e Leu Asn Gln Cys Cys                   100      - #           105      - #           110                   - - Xaa Tyr Phe Arg Thr Gln Ala Gln Xaa His Il - #e Xaa Thr Leu Leu Xaa               115          - #       120          - #       125                       - - Ser Leu Phe Lys Xaa Tyr Gln Lys Xaa Ser Se - #r Xaa Ala Cys Asn Val           130              - #   135              - #   140                           - -  - - (2) INFORMATION FOR SEQ ID NO:101:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 116 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:101:                              - - Met Glu Leu Gly Leu Ser Trp Val Phe Leu Va - #l Ala Ile Leu Glu Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val His Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ala Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Asn Tyr Asp Met His Trp Val Arg Gln Al - #a Thr Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Ala Asn Gly Thr Ala Gly As - #p Thr Tyr Tyr Pro Gly        65                 - # 70                 - # 75                 - # 80        - - Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Gl - #u Asn Ala Lys Asn Ser                        85 - #                 90 - #                 95               - - Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Gl - #y Asp Thr Ala Val Tyr                   100      - #           105      - #           110                   - - Tyr Cys Ala Arg                                                                   115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:102:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 119 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:102:                              - - Met Glu Phe Gly Leu Ser Trp Ile Phe Leu Pr - #o Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Ala Leu Val Lys                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Asn Ala Trp Met Ser Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Gly Arg Ile Lys Ser Lys Thr As - #p Gly Gly Thr Thr Asp        65                 - # 70                 - # 75                 - # 80        - - Tyr Ala Ala Pro Val Lys Gly Arg Phe Thr Il - #e Ser Arg Asp Asp Ser                        85 - #                 90 - #                 95               - - Lys Asn Thr Leu Tyr Leu Gln Met Asn Ser Le - #u Lys Thr Glu Asp Thr                   100      - #           105      - #           110                   - - Ala Val Tyr Tyr Cys Thr Thr                                                       115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:103:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:103:                              - - Met Glu Phe Gly Leu Ser Trp Val Phe Leu Al - #a Gly Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Asn Ser Asp Met Asn Trp Ala Arg Lys Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Gly Val Ser Trp Asn Gly Se - #r Arg Thr His Tyr Val        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Arg Arg Phe Ile Ile Ser Ar - #g Asp Asn Ser Arg Asn                        85 - #                 90 - #                 95               - - Ser Leu Tyr Leu Gln Lys Asn Arg Arg Arg Al - #a Glu Asp Met Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Val Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:104:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 116 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:104:                              - - Met Asp Cys Thr Trp Gly Ile Leu Phe Leu Va - #l Ala Ser Xaa Thr Asp         1               5 - #                 10 - #                 15               - - Val His Ser Gln Val Gln Leu Leu Gln Pro Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Ala Ser Ser Val Lys Val Ser Trp Pro Gl - #y Phe Gln Ile His Leu                35         - #         40         - #         45                       - - His Gln Ile Leu Tyr Thr Val Gly Ala Thr Gl - #y Pro Trp Thr Arg Ala            50             - #     55             - #     60                           - - Trp Leu Gly Cys Ile Asn Pro Tyr Asn Asp As - #n Thr His Tyr Ala Gln        65                 - # 70                 - # 75                 - # 80        - - Lys Phe Arg Gly Arg Val Thr Ile Thr Ser As - #p Arg Ser Val Ser Thr                        85 - #                 90 - #                 95               - - Ala Tyr Met Glu Leu Ser Ser Leu Arg Ser Gl - #u Asp Met Val Val Tyr                   100      - #           105      - #           110                   - - Ser Cys Val Arg                                                                   115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:105:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:105:                              - - Met Asp Trp Thr Trp Ser Ile Leu Phe Leu Va - #l Ala Ala Pro Thr Gly         1               5 - #                 10 - #                 15               - - Ala His Ser Gln Val Gln Leu Val Gln Ser Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Ala Ser Val Lys Val Ser Cys Lys Al - #a Ser Gly Tyr Thr Phe                35         - #         40         - #         45                       - - Thr Ser Tyr Gly Ile Ser Trp Val Arg Gln Al - #a Pro Gly Gln Gly Leu            50             - #     55             - #     60                           - - Glu Trp Met Gly Trp Ile Ser Ala Tyr Asn Gl - #y Asn Thr Asn Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Gln Lys Leu Gln Gly Arg Val Thr Met Thr Th - #r Asp Thr Ser Thr Ser                        85 - #                 90 - #                 95               - - Thr Ala Tyr Met Glu Leu Arg Ser Leu Arg Se - #r Asp Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:106:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:106:                              - - Met Glu Phe Gly Leu Ser Trp Val Phe Leu Va - #l Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Val Val Arg                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Asp Asp Tyr Gly Met Ser Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Gly Ile Asn Trp Asn Gly Gl - #y Ser Thr Gly Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ala Lys Asn                        85 - #                 90 - #                 95               - - Ser Leu Tyr Leu Gln Met Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Leu                   100      - #           105      - #           110                   - - Tyr His Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:107:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:107:                              - - Met Glu Leu Gly Leu Arg Trp Val Phe Leu Va - #l Ala Ile Leu Glu Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Lys                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Ser Met Asn Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Ser Ile Ser Ser Ser Ser Se - #r Tyr Ile Tyr Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ala Lys Asn                        85 - #                 90 - #                 95               - - Ser Leu Tyr Leu Gln Met Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:108:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 118 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:108:                              - - Met Glu Ser Trp Leu Ser Trp Val Phe Leu Al - #a Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val His Leu Val Glu Ser Gl - #y Gly Ala Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Tyr Tyr Tyr Met Ser Gly Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Gly Phe Ile Arg Asn Lys Ala As - #n Gly Gly Thr Thr Glu        65                 - # 70                 - # 75                 - # 80        - - Thr Thr Ser Val Lys Gly Arg Phe Thr Ile Se - #r Arg Asp Asp Ser Lys                        85 - #                 90 - #                 95               - - Ser Ile Thr Tyr Leu Gln Met Lys Ser Leu Ly - #s Thr Glu Asp Thr Ala                   100      - #           105      - #           110                   - - Val Tyr Tyr Cys Ser Arg                                                           115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:109:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:109:                              - - Met Glu Phe Gly Leu Ser Trp Leu Phe Leu Va - #l Ala Lys Ile Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Leu Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Ala Met Ser Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Ala Ile Ser Gly Ser Gly Gl - #y Ser Thr Tyr Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ser Lys Asn                        85 - #                 90 - #                 95               - - Thr Leu Tyr Leu Gln Met Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Lys                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:110:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:110:                              - - Met Asp Cys Thr Trp Arg Ile Leu Phe Leu Va - #l Ala Ala Ala Thr Gly         1               5 - #                 10 - #                 15               - - Thr His Ala Gln Val Gln Leu Val Gln Ser Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Ala Ser Val Lys Val Ser Cys Lys Va - #l Ser Gly Tyr Thr Leu                35         - #         40         - #         45                       - - Thr Glu Leu Ser Met His Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Met Gly Gly Phe Asp Pro Glu Asp Gl - #y Glu Thr Ile Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Gln Lys Phe Gln Gly Arg Val Thr Met Thr Gl - #u Asp Thr Ser Thr Asp                        85 - #                 90 - #                 95               - - Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Se - #r Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Thr                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:111:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 111 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:111:                              - - Trp Ser Leu Cys Ala Gly Phe Ser Leu Leu Le - #u Phe Asn Val Ser Ser         1               5 - #                 10 - #                 15               - - Val Arg Cys Ser Trp Trp Ser Leu Gly Glu Al - #a Cys Lys Ser Leu Arg                    20     - #             25     - #             30                   - - Gly Pro Arg Asp Ser Pro Val Gln Pro Leu As - #n Ser Pro Ser Val Ala                35         - #         40         - #         45                       - - Thr Thr Thr Val Ser Ala Arg Leu Gln Gly Me - #t Gly Trp Ser Trp Phe            50             - #     55             - #     60                           - - Asp Lys Leu Ile Leu Met Gly Val Ala His Th - #r Ser Thr Pro Val Arg        65                 - # 70                 - # 75                 - # 80        - - Thr Asp Ser Ile Pro Pro Glu Ile Thr Pro Ar - #g Thr His Phe Ile Cys                        85 - #                 90 - #                 95               - - Lys Thr Ala Lys Pro Arg Thr Arg Pro Ser Il - #e Ser Val Pro Glu                       100      - #           105      - #           110                   - -  - - (2) INFORMATION FOR SEQ ID NO:112:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 119 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:112:                              - - Met Asp Thr Leu Cys Tyr Thr Leu Leu Leu Le - #u Thr Thr Pro Ser Trp         1               5 - #                 10 - #                 15               - - Val Leu Ser Gln Val Thr Leu Lys Glu Ser Gl - #y Pro Val Leu Val Lys                    20     - #             25     - #             30                   - - Pro Thr Glu Thr Leu Thr Leu Thr Cys Thr Va - #l Ser Gly Phe Ser Leu                35         - #         40         - #         45                       - - Ser Asn Ala Arg Met Gly Val Ser Trp Ile Ar - #g Gln Pro Pro Gly Lys            50             - #     55             - #     60                           - - Ala Leu Glu Trp Leu Ala His Ile Phe Ser As - #n Asp Glu Lys Ser Tyr        65                 - # 70                 - # 75                 - # 80        - - Ser Thr Ser Leu Lys Ser Arg Leu Thr Ile Se - #r Lys Asp Thr Ser Lys                        85 - #                 90 - #                 95               - - Ser Gln Val Val Leu Thr Met Thr Asn Met As - #p Pro Val Asp Thr Ala                   100      - #           105      - #           110                   - - Thr Tyr Tyr Cys Ala Arg Ile                                                       115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:113:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 112 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:113:                              - - Met Tyr Trp Thr Trp Arg Ile Leu Phe Leu Va - #l Ala Ala Ala Thr Gly         1               5 - #                 10 - #                 15               - - Val His Ser Gln Val Gln Leu Val Gln Ser Gl - #y Pro Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Ala Ser Leu Lys Val Ser Cys Lys Al - #a Ser Gly Tyr Thr Phe                35         - #         40         - #         45                       - - Thr Ser Tyr Ala Ile Ser Trp Val Gln Ala Hi - #s Gly Gln Gly Leu Glu            50             - #     55             - #     60                           - - Glu Met Gly Trp Ile Asn Thr Asn Thr Gly As - #n Leu Thr Tyr Ala Gln        65                 - # 70                 - # 75                 - # 80        - - Gly Phe Thr Gly Arg Phe Val Phe Ser Met As - #p Thr Ser Val Ser Met                        85 - #                 90 - #                 95               - - Ala Tyr Leu His Ile Ser Ser Leu Lys Ala Gl - #u Asp Thr Cys Lys Arg                   100      - #           105      - #           110                   - -  - - (2) INFORMATION FOR SEQ ID NO:114:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:114:                              - - Met Lys His Leu Trp Phe Phe Leu Leu Leu Va - #l Ala Ala Pro Arg Trp         1               5 - #                 10 - #                 15               - - Val Leu Ser Gln Val Gln Leu Gln Glu Ser Gl - #y Pro Gly Leu Val Lys                    20     - #             25     - #             30                   - - Pro Ser Asp Thr Leu Ser Leu Thr Cys Ala Va - #l Ser Gly Tyr Ser Ile                35         - #         40         - #         45                       - - Ser Ser Ser Asn Trp Trp Gly Trp Ile Arg Gl - #n Pro Pro Gly Lys Gly            50             - #     55             - #     60                           - - Leu Glu Trp Ile Gly Tyr Ile Tyr Tyr Ser Gl - #y Ser Thr Tyr Tyr Asn        65                 - # 70                 - # 75                 - # 80        - - Pro Ser Leu Lys Ser Arg Val Thr Met Ser Va - #l Asp Thr Ser Lys Asn                        85 - #                 90 - #                 95               - - Gln Phe Ser Leu Lys Leu Ser Ser Val Thr Al - #a Val Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:115:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:115:                              - - Met Glu Phe Gly Leu Ser Trp Val Phe Leu Va - #l Ala Leu Leu Arg Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Gln Val Gln Leu Val Glu Ser Gl - #y Gly Gly Val Val Gln                    20     - #             25     - #             30                   - - Pro Gly Arg Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Gly Met His Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ala Val Ile Ser Tyr Asp Gly Se - #r Asn Lys Tyr Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ser Lys Asn                        85 - #                 90 - #                 95               - - Thr Leu Tyr Leu Gln Met Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:116:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 118 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:116:                              - - Met Lys His Leu Trp Phe Phe Leu Leu Leu Va - #l Ala Ala Pro Arg Trp         1               5 - #                 10 - #                 15               - - Val Leu Pro Gln Val Gln Leu Gln Glu Ser Gl - #y Pro Gly Leu Val Lys                    20     - #             25     - #             30                   - - Pro Ser Gln Thr Leu Ser Leu Thr Cys Thr Va - #l Ser Gly Gly Ser Ile                35         - #         40         - #         45                       - - Ser Ser Gly Gly Tyr Tyr Trp Ser Trp Ile Ar - #g Gln His Pro Gly Lys            50             - #     55             - #     60                           - - Gly Leu Glu Trp Ile Gly Tyr Ile Tyr Tyr Se - #r Gly Ser Thr Tyr Tyr        65                 - # 70                 - # 75                 - # 80        - - Asn Pro Ser Leu Lys Ser Arg Val Thr Ile Se - #r Val Asp Thr Ser Lys                        85 - #                 90 - #                 95               - - Asn Gln Phe Ser Leu Lys Leu Ser Ser Val Th - #r Ala Ala Asp Thr Ala                   100      - #           105      - #           110                   - - Val Tyr Tyr Cys Ala Arg                                                           115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:117:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:117:                              - - Met Glu Phe Gly Leu Ser Trp Val Phe Leu Va - #l Ala Leu Leu Arg Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Gln Val Gln Leu Val Glu Ser Gl - #y Gly Gly Val Val Gln                    20     - #             25     - #             30                   - - Pro Gly Arg Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Gly Met His Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ala Val Ile Trp Tyr Asp Gly Se - #r Asn Lys Tyr Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Ala Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ser Thr Asn                        85 - #                 90 - #                 95               - - Thr Leu Phe Leu Gln Met Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:118:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 116 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:118:                              - - Met Lys His Leu Trp Phe Phe Leu Leu Leu Va - #l Ala Ala Pro Arg Trp         1               5 - #                 10 - #                 15               - - Val Leu Ser Gln Val Gln Leu Gln Gln Trp Gl - #y Ala Gly Leu Leu Lys                    20     - #             25     - #             30                   - - Pro Ser Glu Thr Leu Ser Leu Thr Cys Ala Va - #l Tyr Gly Gly Ser Phe                35         - #         40         - #         45                       - - Ser Gly Tyr Tyr Trp Ser Trp Ile Arg Gln Pr - #o Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Ile Gly Glu Ile Asn His Ser Gly Se - #r Thr Asn Tyr Asn Pro        65                 - # 70                 - # 75                 - # 80        - - Ser Leu Lys Ser Arg Val Thr Ile Ser Val As - #p Thr Ser Lys Asn Gln                        85 - #                 90 - #                 95               - - Phe Ser Leu Lys Leu Ser Ser Val Thr Ala Al - #a Asp Thr Ala Val Tyr                   100      - #           105      - #           110                   - - Tyr Cys Ala Arg                                                                   115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:119:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:119:                              - - Met Glu Phe Gly Leu Ser Trp Val Phe Leu Al - #a Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Asn Ser Asp Met Asn Trp Val His Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Gly Val Ser Trp Asn Gly Se - #r Arg Thr His Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Ile Ile Ser Ar - #g Asp Asn Ser Arg Asn                        85 - #                 90 - #                 95               - - Thr Leu Tyr Leu Gln Thr Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Val Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:120:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 112 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:120:                              - - Met Glu Phe Gly Leu Ser Trp Gly Phe His Va - #l Ala Asn Val Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val His Leu Val Glu Ser Le - #u Gly Gly Leu Leu Pro                    20     - #             25     - #             30                   - - Gly Gly Pro Asp Phe Leu Leu Gln Pro Leu As - #p Ser Pro Leu Val Pro                35         - #         40         - #         45                       - - Leu Leu Gly Thr Gly Ala Gly Ser Ile Arg Le - #u Leu Gly Lys Gly Trp            50             - #     55             - #     60                           - - Ser Arg Ser His Leu Val Val Val Val Ala Gl - #n Ala Met Gln Thr Leu        65                 - # 70                 - # 75                 - # 80        - - Arg Val Asp Ser Pro Ser Pro Glu Met Met Pr - #o Arg Asn His Cys Ile                        85 - #                 90 - #                 95               - - Cys Lys Thr Ala Ser Glu Pro Arg Ile Gly Le - #u Cys Ile Thr Val Val                   100      - #           105      - #           110                   - -  - - (2) INFORMATION FOR SEQ ID NO:121:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 111 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:121:                              - - Met Leu Phe Gly Leu Ser Trp Pro Phe Arg Ph - #e Thr Ile Leu Arg Gly         1               5 - #                 10 - #                 15               - - Val Gln Tyr Glu Val Gln Leu Val Glu Ser Gl - #y Gly Asp Leu Val Gln                    20     - #             25     - #             30                   - - Leu Trp Trp Val Leu Arg Leu Ser Cys Ala Al - #a Cys Gly Phe Ile Leu                35         - #         40         - #         45                       - - Arg Ser Asn Trp Ser His Arg Ala Ser Arg Ly - #s Gly Leu Ala Trp Asn            50             - #     55             - #     60                           - - Asp Met Val Ser Tyr Ile Ser Ala Ser Gly Gl - #y Ser Leu Tyr Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Thr Glu Gly Ile His His Leu Arg Gln Tr - #p Gln Glu His Ala Val                        85 - #                 90 - #                 95               - - Leu Ala Asn Glu Gln Ser Glu Arg Gly Leu Gl - #y Cys Val Glu Arg                       100      - #           105      - #           110                   - -  - - (2) INFORMATION FOR SEQ ID NO:122:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 115 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:122:                              - - Met Gln Phe Val Leu Ser Trp Val Phe Leu Va - #l Gly Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Arg Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Val                35         - #         40         - #         45                       - - Ser Ser Asn Glu Met Ser Trp Ile Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Ser Ile Ser Gly Gly Ser Th - #r Tyr Tyr Ala Asp Ser        65                 - # 70                 - # 75                 - # 80        - - Arg Lys Gly Arg Phe Thr Ile Ser Arg Asp As - #n Ser Lys Asn Thr Leu                        85 - #                 90 - #                 95               - - Tyr Leu Gln Met Asn Asn Leu Arg Ala Glu Gl - #y Thr Ala Ala Tyr Tyr                   100      - #           105      - #           110                   - - Cys Ala Arg                                                                       115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:123:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 118 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:123:                              - - Met Lys His Leu Trp Phe Phe Leu Leu Leu Va - #l Ala Ala Pro Arg Trp         1               5 - #                 10 - #                 15               - - Val Leu Ser Gln Leu Gln Leu Gln Glu Ser Gl - #y Pro Gly Leu Val Lys                    20     - #             25     - #             30                   - - Pro Ser Glu Thr Leu Ser Leu Thr Cys Thr Va - #l Ser Gly Gly Ser Ile                35         - #         40         - #         45                       - - Ser Ser Ser Ser Tyr Tyr Trp Gly Trp Ile Ar - #g Gln Pro Pro Gly Lys            50             - #     55             - #     60                           - - Gly Leu Glu Trp Ile Gly Ser Ile Tyr Tyr Se - #r Gly Ser Thr Tyr Tyr        65                 - # 70                 - # 75                 - # 80        - - Asn Pro Ser Leu Lys Ser Arg Val Thr Ile Se - #r Val Asp Thr Ser Lys                        85 - #                 90 - #                 95               - - Asn Gln Phe Ser Leu Lys Leu Ser Ser Val Th - #r Ala Ala Asp Thr Ala                   100      - #           105      - #           110                   - - Val Tyr Tyr Cys Ala Arg                                                           115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:124:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 114 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:124:                              - - Met Glu Ser Trp Leu Ser Trp Val Phe Leu Al - #a Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Ser Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Gly Met Ser Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Val Ala His Ile Trp Asn Asp Gly Ser Gl - #n Lys Tyr Tyr Ala Asp        65                 - # 70                 - # 75                 - # 80        - - Ser Val Lys Gly Arg Phe Thr Ile Ser Glu Th - #r Ile Leu Arg Ala Cys                        85 - #                 90 - #                 95               - - Ser Ile Cys Lys Trp Thr Val Lys Leu Arg Th - #r Arg Pro Cys Ile Thr                   100      - #           105      - #           110                   - - Val Pro                                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:125:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 118 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:125:                              - - Met Glu Phe Gly Leu Ser Trp Val Phe Leu Va - #l Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Val Val Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Asp Asp Tyr Thr Met His Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Leu Ile Ser Trp Asp Gly Gl - #y Ser Thr Tyr Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ser Lys Asn                        85 - #                 90 - #                 95               - - Ser Leu Tyr Leu Gln Met Asn Ser Leu Arg Th - #r Glu Asp Thr Ala Leu                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Lys Asp                                                           115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:126:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 115 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:126:                              - - Met Leu Phe Gly Leu Ser Trp Ala Phe Leu Va - #l Thr Ile Leu Arg Gly         1               5 - #                 10 - #                 15               - - Val Gln Tyr Glu Val Gln Leu Val Glu Ser Ph - #e Phe Phe Phe Phe Phe                    20     - #             25     - #             30                   - - His Phe Leu Ala Asn Ile His Gly Leu Gln As - #n Asn Gly Leu Ala Phe                35         - #         40         - #         45                       - - Leu Pro Thr Leu Tyr Arg His His Gln Phe Se - #r Pro Cys Leu Gly Phe            50             - #     55             - #     60                           - - Pro Glu Glu Cys Cys His His Leu Ser Cys Se - #r Phe Arg Lys Asn Ala        65                 - # 70                 - # 75                 - # 80        - - Pro Ser Thr His Leu His Leu Ser Ala Cys Il - #e Ser Ile Cys Leu Gly                        85 - #                 90 - #                 95               - - Arg Ser Gln Gln Pro Xaa Glu His Ser Pro Hi - #s Pro Thr Met Leu Leu                   100      - #           105      - #           110                   - - Glu Gly Val                                                                       115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:127:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:127:                              - - Met Asp Trp Thr Trp Arg Ile Leu Phe Leu Va - #l Ala Ala Ala Thr Asp         1               5 - #                 10 - #                 15               - - Ala Tyr Ser Gln Met Gln Leu Val Gln Ser Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Thr Gly Ser Ser Val Lys Val Ser Cys Lys Al - #a Ser Gly Tyr Thr Phe                35         - #         40         - #         45                       - - Thr Tyr Arg Tyr Leu His Trp Val Arg Gln Al - #a Pro Gly Gln Ala Leu            50             - #     55             - #     60                           - - Glu Trp Met Gly Trp Ile Thr Pro Phe Asn Gl - #y Asn Thr Asn Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Gln Lys Phe Gln Asp Arg Val Thr Ile Thr Ar - #g Asp Arg Ser Met Ser                        85 - #                 90 - #                 95               - - Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Se - #r Glu Asp Thr Ala Met                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:128:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:128:                              - - Met Asp Trp Thr Trp Arg Val Phe Cys Leu Le - #u Ala Val Ala Pro Gly         1               5 - #                 10 - #                 15               - - Ala His Ser Gln Val Gln Leu Val Gln Ser Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Ala Ser Val Lys Val Ser Cys Lys Al - #a Ser Gly Tyr Thr Phe                35         - #         40         - #         45                       - - Thr Ser Tyr Tyr Met His Trp Val Arg Gln Al - #a Pro Gly Gln Gly Leu            50             - #     55             - #     60                           - - Glu Trp Met Gly Ile Ile Asn Pro Ser Gly Gl - #y Ser Thr Ser Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Gln Lys Phe Gln Gly Arg Val Thr Met Thr Ar - #g Asp Thr Ser Thr Ser                        85 - #                 90 - #                 95               - - Thr Val Tyr Met Glu Leu Ser Ser Leu Arg Se - #r Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:129:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 110 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:129:                              - - Arg Glu Phe Val Leu Ser Trp Val Phe Leu Va - #l Ala Ile Leu Lys Cys         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Asp Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Pro Ser Cys Ala Al - #a Ser Gly Phe Ala Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Val Leu His Trp Val Arg Arg Al - #a Pro Gly Lys Gly Pro            50             - #     55             - #     60                           - - Glu Trp Val Ser Ala Ile Gly Thr Gly Gly As - #p Thr Tyr Tyr Ala Asp        65                 - # 70                 - # 75                 - # 80        - - Ser Val Met Gly Arg Phe Thr Ile Ser Arg As - #p Asn Ala Lys Lys Ser                        85 - #                 90 - #                 95               - - Leu Tyr Leu Lys Thr Ala Leu Arg Thr Trp Le - #u Cys Ile Met                           100      - #           105      - #           110                   - -  - - 2) INFORMATION FOR SEQ ID NO:130:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:130:                              - - Met Glu Leu Gly Leu Cys Trp Val Phe Leu Va - #l Ala Ile Leu Glu Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Ser Met Asn Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Tyr Ile Ser Ser Ser Ser Se - #r Thr Ile Tyr Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ar - #g Asp Asn Ala Lys Asn                        85 - #                 90 - #                 95               - - Ser Leu Tyr Leu Gln Met Asn Ser Leu Arg Al - #a Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:131:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 119 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:131:                              - - Met Glu Phe Gly Leu Ser Trp Val Phe Leu Va - #l Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Arg Ser Leu Arg Leu Ser Cys Thr Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Gly Asp Tyr Ala Met Ser Trp Phe Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Gly Phe Ile Arg Ser Lys Ala Ty - #r Gly Gly Thr Thr Glu        65                 - # 70                 - # 75                 - # 80        - - Tyr Thr Ala Ser Val Lys Gly Arg Phe Thr Il - #e Ser Arg Asp Gly Ser                        85 - #                 90 - #                 95               - - Lys Ser Ile Ala Tyr Leu Gln Met Asn Ser Le - #u Lys Thr Glu Asp Thr                   100      - #           105      - #           110                   - - Ala Val Tyr Tyr Cys Thr Arg                                                       115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:132:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 55 amino - #acids                                                  (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:132:                              - - Met Glu Leu Gly Leu Ser Trp Val Ser Leu Va - #l Ile Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Cys Gly Val Gln Met Val Glu Ser Trp Gl - #y Glu Leu Ala Gln Xaa                    20     - #             25     - #             30                   - - Glu Cys Ala Asp Ser Ala Val His Pro Leu As - #n Pro Pro Ser Val Ala                35         - #         40         - #         45                       - - Thr Arg Ser Ala Glu Ser Ala                                                    50             - #     55                                                  - -  - - 2) INFORMATION FOR SEQ ID NO:133:                                     - -     (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino ac - #ids                                                (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                   - -    (ii) MOLECULE TYPE: protein                                             - -    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:133:                               - - Met Gly Ser Thr Ala Ile Leu Ala Leu Leu Le - #u Ala Val Leu Gln Gly         1               5 - #                 10 - #                 15               - - Val Cys Ser Glu Val Gln Leu Val Gln Ser Gl - #y Ala Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Glu Ser Leu Lys Ile Ser Cys Lys Gl - #y Ser Gly Tyr Ser Phe                35         - #         40         - #         45                       - - Thr Ser Tyr Trp Ile Gly Trp Val Arg Gln Me - #t Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Met Gly Ile Ile Tyr Pro Gly Asp Se - #r Asp Thr Arg Tyr Ser        65                 - # 70                 - # 75                 - # 80        - - Pro Ser Phe Gln Gly Gln Val Thr Ile Ser Al - #a Asp Lys Ser Ile Ser                        85 - #                 90 - #                 95               - - Thr Ala Tyr Leu Gln Trp Ser Ser Leu Lys Al - #a Ser Asp Thr Ala Met                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Arg                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:134:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 116 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:134:                              - - Met Glu Ser Gly Leu Ser Trp Val Phe Leu Va - #l Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Leu Val Gln Pro                    20     - #             25     - #             30                   - - Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Se - #r Gly Phe Thr Phe Ser                35         - #         40         - #         45                       - - Ser Ser Trp Met His Trp Val Cys Gln Ala Pr - #o Glu Lys Gly Leu Glu            50             - #     55             - #     60                           - - Trp Val Ala Asp Ile Lys Cys Asp Gly Ser Gl - #u Lys Tyr Tyr Val Asp        65                 - # 70                 - # 75                 - # 80        - - Ser Val Lys Gly Arg Leu Thr Ile Ser Arg As - #p Asn Ala Lys Asn Ser                        85 - #                 90 - #                 95               - - Leu Tyr Leu Gln Val Asn Ser Leu Arg Ala Gl - #u Asp Met Thr Val Tyr                   100      - #           105      - #           110                   - - Tyr Cys Val Arg                                                                   115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:135:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 116 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:135:                              - - Met Glu Phe Trp Leu Ser Trp Val Phe Leu Va - #l Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Gly Gly Leu Ile Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Val                35         - #         40         - #         45                       - - Ser Ser Asn Tyr Met Ser Trp Val Arg Gln Al - #a Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Val Ser Val Ile Tyr Ser Gly Gly Se - #r Thr Tyr Tyr Ala Asp        65                 - # 70                 - # 75                 - # 80        - - Ser Val Lys Gly Arg Phe Thr Ile Ser Arg As - #p Asn Ser Lys Asn Thr                        85 - #                 90 - #                 95               - - Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Gl - #u Asp Thr Ala Val Tyr                   100      - #           105      - #           110                   - - Tyr Cys Ala Arg                                                                   115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:136:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 112 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:136:                              - - Met Glu Leu Gly Leu Ser Trp Val Phe Thr Va - #l Thr Val Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #u Glu Asn Gln Arg Gln                    20     - #             25     - #             30                   - - Leu Gly Gly Ser Leu Arg Leu Ser Cys Ala As - #p Ser Gly Leu Thr Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Met Ser Ser Asp Ser Gln Ala Pr - #o Gly Lys Gly Leu Glu            50             - #     55             - #     60                           - - Val Val Asp Ile Asp Arg Ser Gln Leu Cys Ty - #r Ala Gln Ser Val Lys        65                 - # 70                 - # 75                 - # 80        - - Ser Arg Phe Thr Ile Ser Lys Glu Asn Ala Ly - #s Asn Ser Leu Cys Leu                        85 - #                 90 - #                 95               - - Gln Met Asn Ser Leu Arg Ala Glu Gly Thr Al - #a Val Tyr Tyr Cys Met                   100      - #           105      - #           110                   - -  - - 2) INFORMATION FOR SEQ ID NO:137:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 120 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:137:                              - - Met Lys His Leu Trp Phe Phe Leu Leu Leu Va - #l Ala Ala Pro Arg Trp         1               5 - #                 10 - #                 15               - - Val Leu Ser Gln Val Gln Leu Gln Glu Ser Gl - #y Pro Gly Leu Val Lys                    20     - #             25     - #             30                   - - Pro Ser Glu Thr Leu Ser Leu Ile Cys Ala Va - #l Ser Gly Asp Ser Ile                35         - #         40         - #         45                       - - Ser Ser Gly Asn Trp Ile Trp Val Arg Gln Pr - #o Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Ile Gly Glu Ile His His Ser Gly Se - #r Thr Tyr Tyr Asn Pro        65                 - # 70                 - # 75                 - # 80        - - Ser Leu Lys Ser Arg Ile Thr Met Ser Val As - #p Thr Ser Lys Asn Gln                        85 - #                 90 - #                 95               - - Phe Tyr Leu Lys Leu Ser Ser Val Thr Ala Al - #a Asp Thr Ala Val Tyr                   100      - #           105      - #           110                   - - Tyr Cys Ala Arg Tyr Thr Val Arg                                                   115          - #       120                                              - -  - - (2) INFORMATION FOR SEQ ID NO:138:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 69 amino - #acids                                                  (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:138:                              - - Met Asp Trp Thr Trp Ser Ile Leu Phe Leu Va - #l Ala Ala Ala Thr Gly         1               5 - #                 10 - #                 15               - - Ala His Ser Arg Val Gln Leu Val Gln Ser Gl - #y Pro Glu Val Lys Gln                    20     - #             25     - #             30                   - - Pro Gly Ala Ser Ala Lys Val Ser Cys Lys Va - #l Ser Gly Thr Val Ile                35         - #         40         - #         45                       - - Thr Tyr Gly Met Asn Trp Ile Arg Gln Thr Pr - #o Gly Gln Gly Leu Glu            50             - #     55             - #     60                           - - Trp Met Gly Trp Ile                                                        65                                                                             - -  - - (2) INFORMATION FOR SEQ ID NO:139:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 117 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:139:                              - - Met Asp Trp Ile Trp Arg Ile Leu Phe Leu Va - #l Gly Ala Ala Ile Gly         1               5 - #                 10 - #                 15               - - Ala His Ser Gln Met Gln Leu Val Gln Ser Gl - #y Pro Glu Val Lys Lys                    20     - #             25     - #             30                   - - Pro Gly Thr Ser Val Lys Val Ser Cys Lys Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Thr Ser Ser Ala Val Gln Trp Val Arg Gln Al - #a Arg Gly Gln Arg Leu            50             - #     55             - #     60                           - - Glu Trp Ile Gly Trp Ile Val Val Gly Ser Gl - #y Asn Thr Asn Tyr Ala        65                 - # 70                 - # 75                 - # 80        - - Gln Lys Phe Gln Glu Arg Val Thr Ile Thr Ar - #g Asp Met Ser Thr Ser                        85 - #                 90 - #                 95               - - Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Se - #r Glu Asp Thr Ala Val                   100      - #           105      - #           110                   - - Tyr Tyr Cys Ala Ala                                                               115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:140:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 116 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:140:                              - - Met Lys His Leu Trp Phe Phe Leu Leu Leu Va - #l Ala Ala Pro Arg Trp         1               5 - #                 10 - #                 15               - - Val Leu Ser Gln Val Gln Leu Gln Glu Ser Gl - #y Pro Gly Leu Val Lys                    20     - #             25     - #             30                   - - Pro Ser Glu Thr Leu Ser Leu Thr Cys Thr Va - #l Ser Gly Gly Ser Val                35         - #         40         - #         45                       - - Ser Ser Tyr Tyr Trp Ser Trp Ile Arg Gln Pr - #o Pro Gly Lys Gly Leu            50             - #     55             - #     60                           - - Glu Trp Ile Gly Tyr Ile Tyr Tyr Ser Gly Se - #r Thr Asn Tyr Asn Pro        65                 - # 70                 - # 75                 - # 80        - - Ser Leu Lys Ser Arg Val Thr Ile Ser Val As - #p Thr Ser Lys Asn Gln                        85 - #                 90 - #                 95               - - Phe Ser Leu Lys Leu Ser Ser Val Thr Ala Al - #a Asp Thr Ala Val Tyr                   100      - #           105      - #           110                   - - Tyr Cys Ala Arg                                                                   115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:141:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 113 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:141:                              - - Met Glu Ser Gly Leu Ser Trp Ile Phe Leu Va - #l Ala Val Leu Lys Gly         1               5 - #                 10 - #                 15               - - Cys Pro Val Gly Ala Ala Gly Gly Val Trp Gl - #y Arg Leu Ser Lys Asp                    20     - #             25     - #             30                   - - Trp Gly Val Ser Glu Thr Leu Leu Cys Ser Le - #u Trp Ile His Leu Gln                35         - #         40         - #         45                       - - Leu Cys Tyr Ala Leu Gly Pro Pro Gly Ser Ar - #g Lys Gly Phe Gly Val            50             - #     55             - #     60                           - - Gly Leu Ser Tyr Tyr Lys Trp Tyr Arg Thr Le - #u His Arg Leu Cys Glu        65                 - # 70                 - # 75                 - # 80        - - Gly Leu Ile His His Leu Arg Gln Cys Pro Gl - #u Phe Thr Val Ser Ala                        85 - #                 90 - #                 95               - - Asn Glu Gln Pro Glu Ser Arg Arg His Gly Cy - #s Val Leu Leu Cys Glu                   100      - #           105      - #           110                   - - Arg                                                                        - -  - - (2) INFORMATION FOR SEQ ID NO:142:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 118 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:142:                              - - Met Lys His Leu Trp Phe Phe Leu Leu Leu Va - #l Ala Ala Pro Arg Trp         1               5 - #                 10 - #                 15               - - Val Leu Ser Gln Val Gln Leu Gln Glu Ser Gl - #y Pro Gly Leu Val Lys                    20     - #             25     - #             30                   - - Pro Ser Glu Thr Leu Ser Leu Thr Cys Thr Va - #l Ser Gly Gly Ser Val                35         - #         40         - #         45                       - - Ser Ser Gly Ser Tyr Tyr Trp Ser Trp Ile Ar - #g Gln Pro Pro Gly Lys            50             - #     55             - #     60                           - - Gly Leu Glu Trp Ile Gly Tyr Ile Tyr Tyr Se - #r Gly Ser Thr Asn Tyr        65                 - # 70                 - # 75                 - # 80        - - Asn Pro Ser Leu Lys Ser Arg Val Thr Ile Se - #r Val Asp Thr Ser Lys                        85 - #                 90 - #                 95               - - Asn Gln Phe Ser Leu Lys Leu Ser Ser Val Th - #r Ala Ala Asp Thr Ala                   100      - #           105      - #           110                   - - Val Tyr Tyr Cys Ala Arg                                                           115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:143:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 116 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:143:                              - - Met Glu Phe Gly Leu Ser Trp Val Phe Leu Va - #l Ala Asn Leu Arg Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Gln Leu Val Glu Ser Gl - #y Glu Gly Leu Val Gln                    20     - #             25     - #             30                   - - Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Ser Ser Ala Met His Trp Val Arg Gln Al - #a Pro Arg Lys Gly Leu            50             - #     55             - #     60                           - - Trp Val Ser Val Ile Ser Thr Ser Gly Asp Th - #r Val Leu Tyr Thr Asp        65                 - # 70                 - # 75                 - # 80        - - Ser Val Lys Gly Arg Phe Thr Ile Ser Arg As - #p Asn Ala Gln Asn Ser                        85 - #                 90 - #                 95               - - Leu Ser Leu Gln Met Asn Ser Leu Arg Ala Gl - #u Gly Thr Val Val Tyr                   100      - #           105      - #           110                   - - Tyr Cys Val Lys                                                                   115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:144:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 115 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:144:                              - - Met Gly Phe Glu Leu Thr Arg Ile Phe Leu Va - #l Ala Ile Leu Lys Gly         1               5 - #                 10 - #                 15               - - Val Gln Cys Glu Val Glu Leu Ile Glu Ser Il - #e Glu Gly Leu Arg Gln                    20     - #             25     - #             30                   - - Leu Gly Lys Phe Leu Arg Leu Ser Cys Val Al - #a Ser Gly Phe Thr Phe                35         - #         40         - #         45                       - - Ser Ser Tyr Met Ser Trp Val Asn Glu Thr Le - #u Gly Lys Gly Leu Glu            50             - #     55             - #     60                           - - Gly Val Ile Asp Val Lys Tyr Asp Gly Ser Gl - #n Ile Tyr His Ala Asp        65                 - # 70                 - # 75                 - # 80        - - Ser Val Lys Gly Arg Phe Thr Ile Ser Lys As - #p Asn Ala Lys Asn Ser                        85 - #                 90 - #                 95               - - Pro Tyr Leu Gln Thr Asn Ser Leu Arg Ala Gl - #u Asp Met Thr Met His                   100      - #           105      - #           110                   - - Gly Cys Thr                                                                       115                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:145:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 118 amino - #acids                                                 (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                  - -     (ii) MOLECULE TYPE: protein                                            - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:145:                              - - Met Thr Glu Phe Gly Leu Ser Trp Val Phe Le - #u Val Ala Ile Phe Lys         1               5 - #                 10 - #                 15               - - Gly Val Gln Cys Glu Val Gln Leu Val Glu Se - #r Gly Gly Gly Leu Val                    20     - #             25     - #             30                   - - Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys Al - #a Ala Ser Gly Phe Thr                35         - #         40         - #         45                       - - Phe Ser Ser Tyr Ala Met His Trp Val Arg Gl - #n Ala Pro Gly Lys Gly            50             - #     55             - #     60                           - - Leu Glu Tyr Val Ser Ala Ile Ser Ser Asn Gl - #y Gly Ser Thr Tyr Tyr        65                 - # 70                 - # 75                 - # 80        - - Ala Asn Ser Val Lys Gly Arg Phe Thr Ile Se - #r Arg Asp Asn Ser Lys                        85 - #                 90 - #                 95               - - Asn Thr Leu Tyr Leu Gln Met Gly Ser Leu Ar - #g Ala Glu Asp Met Ala                   100      - #           105      - #           110                   - - Val Tyr Tyr Cys Ala Arg                                                           115                                                                   __________________________________________________________________________ 

We claim:
 1. An isolated polynucleotide comprising the following nucleic acid sequences of (a) through (f) in 5' to 3' order:(a) a nucleic acid sequence of a portion of a human genome isolable from:(i) a yeast artificial chromosome clone Y6 which is isolable from a transformant identified by an international deposit number FERM BP-4271; or (ii) a yeast artificial chromosome clone Y24 which is isolable from a transformant identified by an international deposit number FERM BP-4274; (b) a nucleic acid sequence of a portion of a human genome isolable from a yeast artificial chromosome clone Y21 which is isolable from a transformant identified by an international deposit number FERM BP-4273, wherein said nucleic acid sequence lacks the 5' terminal sequence of said portion of a human genome so isolable from the clone Y21 that duplicates the 3' terminal sequence of the nucleic acid sequence of (a); (c) a nucleic acid sequence of a portion of a human genome isolable from a cosmid vector clone M118 which is isolable from a transformant identified by an international deposit number FERM BP-4278, wherein said nucleic acid sequence lacks the 5' terminal sequence of said portion of a human genome so isolable from the clone M118 that duplicates the 3' terminal sequence of the nucleic acid sequence of (b); (d) a nucleic acid sequence of a portion of a human genome isolable from a cosmid vector clone M84 which is isolable from a transformant identified by an international deposit number FERM BP-4277, wherein said nucleic acid sequence lacks the 5' terminal sequence of said portion of a human genome so isolable from the clone M84 that duplicates the 3' terminal sequence of the nucleic acid sequence of (c); (e) a nucleic acid sequence of a portion of a human genome isolable from a cosmid vector clone M131 which is isolable from a transformant identified by an international deposit number FERM BP-4279, wherein said nucleic acid sequence lacks the 5' terminal sequence of said portion of a human genome so isolable from the clone M131 that duplicates the 3' terminal sequence of the nucleic acid sequence of (d); and (f) a nucleic acid sequence of a portion of a human genome isolable from a cosmid vector clone 3-31 which is isolable from a transformant identified by an international deposit number FERM BP-4276, wherein said nucleic acid sequence lacks the 5' terminal sequence of said portion of a human genome so isolable from the clone 3-31 that duplicates the 3' terminal sequence of the nucleic acid sequence of (e), wherein each of said portions of a human genome which is so isolable from the clone Y6, Y24, Y21, M118, M84, M131 and 3-31 respectively is in a relative position in a human genome as shown in FIG.
 1. 2. The polynucleotide of claim 1 wherein the polynucleotide has the restriction pattern and organization shown in FIG.
 1. 3. The polynucleotide of claim 1 wherein the nucleic acid sequence of a portion of a human genome isolable from the clone Y6 comprises the nucleic acid sequences of SEQ ID NOS: 32 through 64;the nucleic acid sequence of a portion of a human genome isolable from the clone Y24 comprises the nucleic acid sequences of SEQ ID NOS: 32 through 64; the nucleic acid sequence of a portion of a human genome isolable from the clone Y21 comprises the nucleic acid sequences of SEQ ID NOS: 15 through 34; the nucleic acid sequence of a portion of a human genome isolable from the clone M118 comprises the nucleic acid sequences of SEQ ID NOS: 14 and 15; the nucleic acid sequence of a portion of a human genome isolable from the clone M84 comprises the nucleic acid sequences of SEQ ID NOS: 9 through 13; the nucleic acid sequence of a portion of a human genome isolable from the clone M131 comprises the nucleic acid sequences of SEQ ID NOS: 8 and 9; and the nucleic acid sequence of a portion of a human genome isolable from the clone 3-31 comprises the nucleic acid sequences of SEQ ID NOS: 6 through
 8. 4. An isolated polynucleotide comprising the following nucleic acid sequences of (a) and (b) in 5' to 3' order:(a) a nucleic acid sequence of a portion of a human genome isolable from a yeast artificial chromosome clone Y103 which is isolable from a transformant identified by an international deposit number FERM BP-4275; and (b) a nucleic acid sequence of a portion of a human genome isolable from a yeast artificial chromosome clone Y20 which is isolable from a transformant identified by an international deposit number FERMBP-4272, wherein said nucleic acid sequence lacks the 5' terminal sequence of said portion of a human genome so isolable from the clone Y20 that duplicates the 3' terminal sequence of the nucleic acid sequence of (a), wherein each of said portions of a human genome which is so isolable from the clone Y103 and Y20 respectively is in a relative position in a human genome as shown in FIG.
 1. 5. The polynucleotide of claim 4 wherein the polynucleotide has the restriction pattern and organization shown in FIG.
 1. 6. The polynucleotide of claim 4 wherein the nucleic acid sequence of a portion of a human genome isolable from the clone Y103 comprises the nucleic acid sequences of SEQ ID NOS: 1 through 5, and the nucleic acid sequence of a portion of a human genome isolable from the clone Y20 comprises the nucleic acid sequences of SEQ ID NOS: 1 through
 4. 7. An isolated polynucleotide of a portion of a human genome comprising the following nucleic acid sequences of (1) through (28) in 5' to 3' order: (1) SEQ ID NO: 64; SEQ ID NO:61; (3) SEQ ID NO:59; (4) SEQ ID NO:53; (5) SEQ ID NO:51; (6) SEQ ID NO:49; (7) SEQ ID NO:48; (8) SEQ ID NO:46; (9) SEQ ID NO:45; (10) SEQ ID NO:43; (11) SEQ ID NO:39; (12) SEQ ID NO:35; (13) SEQ ID NO:34; (14) SEQ ID NO:33; (15) SEQ ID NO:31; (16) SEQ ID NO:30; (17) SEQ ID NO:28; (18) SEQ ID NO:26; (19) SEQ ID NO:23; (20) SEQ ID NO:21; (21) SEQ ID NO:20; (22) SEQ ID NO: 18; (23) SEQ ID NO:15; (24) SEQ ID NO:13; (25) SEQ ID NO: 11; (26) SEQ ID NO:9; and (27) SEQ ID NO:8; and (28) SEQ ID NO:7;wherein an intervening nucleic acid sequence appears between each of said adjacent nucleic acid sequences of (1) through (28), said intervening nucleic acid sequence being that found in:a) a yeast artificial chromosome clone which is isolable from a transformant identified by an international deposit number selected from the group consisting of FERM BP-4271, FERM BP-4273, and FERM BP-4274; or b) a cosmid vector clone which is isolable from a transformant identified by an international deposit number selected from the group consisting of FERM BP-4276, FERM BP-4277, FERM BP-4278, and FERM BP-4279.
 8. An isolated polynucleotide of a portion of a human genome comprising the following nucleic acid sequences of (1) through to (5) in 5' to 3' order:(1) SEQ ID NO:5 (2) SEQ ID NO:4; (3) SEQ ID NO:3; (4) SEQ ID NO:2; (5) SEQ ID NO: 1;wherein an intervening nucleic acid sequence appears between each of said adjacent nucleic acid sequences of (1) through (5), said intervening nucleic acid sequence being that found in a yeast artificial chromosome clone which is isolable from a transformant identified by an international deposit number selected from the group consisting of FERM BP-4272 and FERM BP-4275. 